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New Biological Entities and the Pharmaceutical or Diagnostic Use 

Thereof 

The present invention provides methods for the treatment of a disease by 
applying a medicament comprising a protease with a defined specificity is 
capable to hydrolyze specific peptide bonds within a target substrate related to 
such disease. The proteases with such a defined specificity can further be used 
for related therapeutic or diagnostic purposes. 

Background 

Academic and industrial research continuously searches for functional proteins to 
be used as therapeutic, research, diagnostic, nutritional, personal care or 
industrial agents. Today, such functional proteins can be classified mainly into 
two categories: natural proteins and engineered proteins. Natural proteins, on 
the one hand, are discovered from nature, e.g. by screening natural isolates or 
by sequencing genomes from diverse species. Engineered proteins, on the other 
hand, are typically based on known proteins and are altered in order to acquire 
modified functionalities. The present invention discloses engineered proteins with 
novel functions as compared to the starting components. Such proteins are 
called NBEs (New Biologic Entities). The NBEs disclosed in the present invention 
are engineered enzymes with novel substrate specificities or fusion proteins of 
such engineered enzymes with other functional components. 

Specificity is an essential element of enzyme function. A cell consists of 
thousands of different, highly reactive catalysts. Yet the cell is able to maintain a 
coordinated metabolism and a highly organized three-dimensional structure. This 
is due in part to the specificity of enzymes, i.e. the selective conversion of their 
respective substrates. Specificity is a qualitative and a quantitative property: the 
specificity of a particular enzyme can vary widely, ranging from just one 
particular type of target molecules to all molecular types with certain chemical 
substructures. In nature, the specificity of an organism's enzymes has been 
evolved to the particular needs of the organism. Arbitrary specificities with high 
value for therapeutic, research, diagnostic, nutritional or industrial applications 
are unlikely to be found in any organism's enzymatic repertoire due to the large 
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space of possible specificities. The only realistic way of obtaining such 
specificities is their generation de novo. 

When comparing enzymes with binders, a paradigm of specificity is given by 
antibodies recognizing individual epitopes as small distinct structures within large 
molecules. The naturally occurring vast range of antibody specificities is 
attributed to the diversity generated by the immune system combined with 
natural selection. Several mechanisms contribute to the vast repertoire of 
antibody specificity and occur at different stages of immune response generation 
and antibody maturation (Janeway, C et al. (1999) I mmunobiology, Elsevier 
Science Ltd., Garland Publishing, New York). Specifically, antibodies contain 
complementarity determining regions (CDRs) which interact with the antigen in a 
highly specific manner and allow discrimination even between very similar 
epitopes. The light as well as the heavy chain of the antibody each contribute 
three CDRs to the binding domain. Nature uses recombination of various gene 
segments combined with further mutagenesis in the generation of CDRs. As a 
result, the sequences of the six CDR loops are highly variable in composition and 
length and this forms the basis for the diversity of binding specificities in 
antibodies. A similar principle for the generation of a diversity of catalytic 
specificities is not known from nature. 

Catalysis, i.e. the increase of the rate of a specific chemical reaction, is besides 
binding the most important protein function. Catalytic proteins, i.e. enzymes, are 
classified according to the chemical reaction they catalyze. 

Transferases are enzymes transferring a group, for example, the methyl group or 
a glycosyl group, from one compound (generally regarded as donor) to another 
compound (generally regarded as acceptor). For example, glycosyltransferases 
(EC 2.4) transfer glycosyl residues from a donor to an acceptor molecule. Some 
of the glycosyltransferases also catalyze hydrolysis, which can be regarded as 
transfer of a glycosyl group from the donor to water. The subclass is further 
subdivided into hexosyltransferases (EC 2.4,1), pentosyltransferases (EC 2.4.2) 
and those transferring other glycosyl groups (EC 2.4.99, Nomenclature 
Committee of the International Union of Biochemistry and Molecular Biology (NC- 
IUBMB)). 
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Oxidoreductases catalyze oxido- reductions. The substrate that is oxidized is 
regarded as hydrogen or electron donor. Oxidoreductases are classified as 
dehydrogenases, oxidases, mono- and dioxygenases. Dehydrogenases transfer 
hydrogen from a hydrogen donor to a hydrogen acceptor molecule. Oxidases 
react with molecular oxygen as hydrogen acceptor and produce oxidized products 
as well as either hydrogen peroxide or water. Monooxygenases transfer one 
oxygen atom from molecular oxygen to the substrate and one is reduced to 
water. In contrast, dioxygenases catalyze the insert of both oxygen atoms from 
molecular oxygen into the substrate. 

Lyases calalyze elimination reactions and thereby generate double bonds or, in 
the reverse direction, catalyze the additions at double bonds. Isomerases 
catalyze intramolecular rearrangements. Ligases catalyze the formation of 
chemical bonds at the expense of ATP consumption. 

Finally, hydrolases are enzymes that catalyze the hydrolysis of chemical bonds 
like C-O or ON. The E.G. classification for these enzymes generally classifies 
them by the nature of the bond hydrolysed and by the nature of the substrate. 
Hydrolases such as lipases and proteases play an important role in nature as well 
in technical applications of biocatalysts. Proteases hydrolyse a peptide bond 
within the context of an oligo- or polypeptide. Depending on the catalytic 
mechanism proteases are grouped into aspartic, serin, cysteine, metallo- and 
threonine proteases (Handbook of proteolytic enzymes. (1998) Eds: Barret, A; 
Rawling, N.; Woessner, J.; Academic Press, London). This classification is based 
on the amino acid side chains that are responsible for catalysis and which are 
typically presented in the active site in very similar orientation to each other. The 
scissile bond of the substrate is brought into register with the catalytic residues 
due to specific interactions between the amino acid side chains of the substrate 
and complementary regions of the protease (Perona, J. & Craik, C (1995) Protein 
Science, 4, 337-360). The residues on the N- and G-terminal side of the scissile 
bond are usually called R, P 2 , P 3 etc and R\ P 2 \ R' and the binding pockets 
complementary to the substrate S 1s 82, S3 and Si', Sz\ S 3 \ respectively 
(nomenclature according to Schlechter & Berger, Biochem. Biophys. Res. 
Commun. 27 (1967) 157-162). The selectivity of proteases can vary widely from 
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being virtually nonselective - e.g. the Subtilisins - over a strict preference at the 
Pi position - e.g. Trypsin selectively cutting on the G-terrninal side of arginine or 
lysine residues - to highly specific proteases - e.g. human tissue-type 
plasminogen activator (t-PA) cleaving at the Oterminal side of the arginine in 
the sequence CPGRVVG (Ding, L et al. (1995) Proc. Natl. Acad. Sci. USA 92, 
7627-7631; Coombs, Get al. (1996) J. Biol. Chem. 271, 4461-4467). 

The specificity of proteases, i.e. their ability to recognize and hydrolyze 
preferentially certain peptide substrates, can be expressed qualitatively and 
quantitatively. Qualitative specificity refers to the kind of amino acid residues 
that are accepted by a protease at certain positions of the peptide substrate. For 
example, trypsin and t-PA are related with respect to their qualitative specificity, 
since both of them require at the R position an arginine or a similar residue. On 
the other hand, quantitative specificity refers to the relative number of peptide 
substrates that are accepted as substrates by the protease, or more precisely, to 
the relative k cat /k M ratios of the protease for the different peptides that are 
accepted by the protease. Proteases that accept only a small portion of all 
possible peptides have a high specificity, whereas the specificity of proteases 
that, as an extreme, cleave any peptide substrate would theoretically be zero. 

Comparison of the primary, secondary as well as the tertiary structure of 
proteases (Fersht, A., Enzyme Structure and Mechanism, W. H. Freeman and 
Company, New York, 1995) allows identification of classes showing a high degree 
of conservation (Rawlings, N.D. & Barrett, A.J. (1997) In: Proteolysis in Ceil 
Functions Eds. Hopsu~Havu,V.K.; Jarvinen,M.; Kirschke,H, pp. 13-21, IOS Press, 
Amsterdam). A widely accepted scheme for protease classification has been 
proposed by Rawlings & Barrett (Handbook of proteolytic enzymes. (1998) Eds: 
Barret, A; Rawling, N.; Woessner, J.; Academic Press, London). For example, the 
serine proteases family can be subdivided into structural classes with 
chymotrypsin (class S1), subtilisin (class S8) and carboxypeptidase (class SC) 
folds, each of which includes nonspecific as well as specific proteases (Rawlings, 
N.D. & Barrett, A.J. (1994) Methods EnzymoL 244, 19-61). This applies to other 
protease families analogously. An additional distinction can be made according to 
the relative location of the cleaved bond in the substrate. Carboxy- and 
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aminopeptidases cleave amino acids from the C- and N-terminus, respectively, 
while endopeptidases cut anywhere along the oligopeptide. 

Many applications would be conceivable if enzymes with a basically unlimited 
spectrum of specificities were available. However, the use of such enzymes with 
high, low or any defined specificity is currently limited to those which can be 
isolated from natural sources. The field of application for these enzymes varies 
from therapeutic, research, diagnostic, nutritional to personal care and industrial 
purposes. 

Enzyme additives in detergents have come to constitute nearly a third of the 
whole industrial enzyme market. Detergent enzymes include proteinases for 
removing organic stains, lipases for removing greasy stains, amylases for 
removing residues of starchy foods and cellulases for restoring of smooth surface 
of the fiber. The best-known detergent enzyme is probably the nonspecific 
proteinase subtilisin, isolated from various Bacillus species. 

Starch enzymes, such as amylases, occupy the majority of those used in food 
processing. While starch enzymes include products that are important for textile 
desizing, alcohol fermentation, paper and pulp processing, and laundry detergent 
additives, the largest application is for the production of high fructose corn syrup. 
The production of corn syrup from starch by means of industrial enzymes was a 
successful alternative to acid hydrolysis. 

Apart from starch processing, enzymes are used for an increasing range of 
applications in food. Enzymes in food can improve texture, appearance and 
nutritional value or may generate desirable flavours and aromas. Currently used 
food enzymes in bakery are amylase, amyloglycosidases, pentosanases for 
breakdown of pentosan and reduced gluten production or glucose oxidases to 
increase the stability of dough. Common enzymes for dairy are rennet (protease) 
as coagulant in cheese production, lactase for hydrolysis of lactose, protease for 
hydrolysis of whey proteins or catalase for the remove! of hydrogen peroxides, 
Enzymes used in brewing process are the above named amylases, but also 
cellulases or proteases to clarify the beer from suspended proteins. In wines and 
fruit juices, cloudiness is more commenly caused by starch and pectins so that 
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amylases and pectinases increase yield and clarification. Papain and other 
proteinases are used for meat tenderizing. 

Enzymes have also been developed to aid animals in the digestion of feed. In the 
western hemisphere, corn is a major source of food for cattle, swine, and 
poultry. In order to improve the bioavailability of phosphate from corn, phytase 
is commonly added (Wyss, M. et al., Biochemical characterization of fungal 
phytases (myo- inositol hexakisphosphate phosphohydrolases); Catalytic 
properties. Applied & Environmental Microbiology 65, 367-373 (1999)). 
Moreover, phytate hydrolysis has been shown to bring about improvements in 
digestibility of protein and absorption of minerals such as calcium (Bedford, M.R. 
& Schulze, H., Exogenous Enzymes for Pigs and Poultry [Review]. Nutrition 
Research Reviews 11, 91-114 (1998)). Another major feed enzyme is xylanase. 
This enzyme is particularly useful as a supplement for feeding stuff comprising 
more than about 10% of wheat barley or rye, because of their relatively high 
soluble fiber content. Xylanases cause two important actions: reduction of 
viscosity of the intestinal contents by hydrolyzing the gel- like high molecular 
weight arabinoxylans in feed (Murphy, T et al., Effect of range of new xylanases 
on in vitro viscosity and on performance of broiler diets. British Pultry Science 
44, S16-S18 (2003)) and break down of polymers in cell wallswhich improve the 
bioavailability of protein and starch. 

Biotech research and development laboratories routinely use special enzymes in 
small quantities along with many other reagents. These enzymes create a 
significant market for various enzymes. Enzymes like alkaline phosphatase, 
horseradish peroxidase and luciferase are only some examples. Thermostable 
DNA polymerases like Taq polymerase or restriction endonucleases 
revolutionized laboratory work. 

The use of enzymes in the diagnosis of disease is another important benefit 
derived from the intensive research in biochemistry. Within the recent past few 
years that interest in diagnostic enzymology has increased and there are still 
large areas of medical research in which the diagnostic potential of enzyme 
reactions has not been explored at all. Common enzymes used for clinical 
diagnosis are acid phosphatase, alanine aminotransferase, alkaline phosphatase, 
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amylase, angiotensin converting enzymes, aspartate aminotransferase, 
cholinesterase, creatinine kinase, gamma glutamyltransferase, lactate 
dehydrogenaseor rennin. 

Therapeutic enzymes are a particular class of drugs, categorized by the FDA as 
biologicals, with a lot of advantages compared to other, especially non-biological 
pharmaceuticals. Examples for successful therapeutic enzymes are human 
clotting factors like factor VIII and factor IX for human treatment. In addition, 
digestive enzymes are used for various deficiencies in human digestive 
processes. Other examples are t-PA and streptokinase for the treatment of 
cardiovascular disease, beta-glucocerebrosidase for the treatment of Type I 
Gaucher disease, L- asparaginase for the the treatment of acute lymphoblastic 
leukemia and DNAse for the treatment of cystic fibrosis. An important issue in 
the application of proteins as therapeutics is their potential immunogenicity. To 
reduce this risk, one would prefer enzymes of human origin, which narrows down 
the set of available enzymes. The provision of designed enzymes, preferably of 
human origin, with novel, tailor-made specificities would allow the specific 
modification of target substrates at will, while minimizing the risk of 
immunogenicity. A further advantage of highly specific enzymes as therapeutics 
would be their lower risk of side effects. Due to the limited possibility of specific 
interactions between a small molecule and a protein, binding to non-target 
proteins and therefore side effects are quite common and often cause 
termination of an otherwise promising lead compound. Specific enzymes, on the 
other hand, provide many more contact sites and mechanisms for substrate 
discrimination and therefore enable a higher specificity and thereby less side 
activities. 

Proteases represent an important class of therapeutic agents (Drugs of today, 
33, 641-648 (1997)). However, currently the therapeutic protease is usually a 
substitute for insufficient acitivity of the body's own proteases. For example, 
factor VII can be administered in certain cases of coagulation deficiencies of 
bleeders or during surgery (Heuer L; Blumenberg D. (2002) Anaesthesist 
51:388). Tissue-type plasminogen activator (t-PA) is applied in acute cardiac 
infarction, initializing the dissolution of fibrin clots through specific cleavage and 
activation of plasminogen (Verstraete, M, et al. (1995) Drugs, 50, 29-41). So far 
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a protease with taylor-made specificity is generated to provide a therapeutic 
agent that specifically activates or inactivates a disease related target protein. 

Monoclonal antibodies represent another important biological class of substances 
with therapeutic capabilities. One of the main antibody targets are tumor 
necrosis factors (TNFs) which belong to the family of cytokines. TNFs play a 
major role in the inflammation process. As homotrimers they could bind to 
receptors of nearly every cell. They activate a multiplicity of cellular genes, 
multiple signal transduction mechanisms, kinases and transcription factors. The 
most important TNFs are TNF-alpha and TNF-beta. TNF-alpha is produced by 
macrophages, monocytes and other cells. TNF-alpha is an inflammation 
mediator. Therefore, research of the last decade has been focused on TNF-alpha 
inhibitors like monoclonal antibodies as possible therapeutics for different 
therapeutic indications like Rheumatoid Arthritis, Crohn's disease or Psoriasis 
(Hamilton et al. (2000) Expert Opin Pharmacother, 1 (5): 1041-1052). One of 
the major disadvantages of monoclonal antibodies are their high costs, so that 
new biological alternatives are of great importance. 

There are a lot of examples for engineered enzymes in literature. Fulani et al. 
(Fulani F. et al. (2003) Protein Engineering 16, 515-519) describe a rhodanase 
(thiosulfat: cyanide sulfurtransferase) from Azotobacter vinelandii which has a 
catalytic domain structurally related to catalytic subunit of Cdc25 phosphatase 
enzymes. The difference in catalytic mechanism depends on the different size of 
the active site. Both rhodanase and phosphatase are highly specific on different 
substrates (sulfate vs. phosphate). The catalytic mechanism of the rhodanase 
could be shifted towards serine/threonine phosphatase by single-residue 
insertion. Therefore, Fulani et al. give a single example for the change of a 
catalytic mechanism by structural comparison and sequence alignment of 
naturally known enzymes from different enzyme classes but lack an indication of 
how to generate a user-definable substrate specificity while keeping the same 
catalytic mechanism. 

The thioredoxin reductase described by Briggs et al. (WO 02/090300 A2) has an 
altered cofactor specificity which preferably binds NADPH compared to NADH. 
Thus, both enzymes, the starting point as well as the resulting engineered 
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enzyme are highly specific towards different substrates. The methods to achieve 
such an altered substrate specificity are either computational processing methods 
or sequence alignments of related proteins to define variable and conserved 
residues. They ail have in common that they are based on the comparison of 
structures and sequences of proteins with known specificities followed by the 
transfer of the same to another backbone. 

There are other examples of specif icity- engineered enzymes and, in particular, of 
proteases which have been published in the literature. None of these examples, 
however, provides a means for generating novel specificites compared to the 
specificity of the starting material used within the described methods. The 
methods range from structure-directed single point mutations (Kurth, T. et al. 
(1998) Biochemistry 37, 11434-11440; Ballinger, Metal. (1996) Biochemistry, 
35:13579-13585), exchange of surface loops between two specific proteases 
(Horrevoets et al. (1993) J. Biol. Chem. 268, 779-782), to random mutagenesis 
either regio -selectively or across the whole gene combined with in-vitro or in- 
vivo selection (Sices, H. & Kristie, T. (1998) Proc. Nati. Acad. Sci. USA, 95, 
2828-2833). 

The rational design of protease specificity is limited to very few examples. This 
approach is severely limited by the insufficient understanding of the complexities 
that govern folding and dynamics as well as structure-function relationships in 
proteins (Corey, M.J. & Corey, E. (1996) Proc. Natl. Acad. Set. USA, 93:11428- 
11434). It is therefore difficult to alter the primary amino acid sequence of a 
protease in order to change its activity or specificity in a predictive way. In a 
successful example, Kurth et al. engineered trypsin to show a preference for a 
dibasic motive (Kurth, T. et al. (1998) Biochemistry, 37:11434-11440). In 
another example, Hedstrom et al. converted the Si substrate specificity of 
trypsin to that of chymotrypsin (Hedstrom, L. et al. (1992) Science, 255:1249- 
1253). This is an example where a known property was transferred from one 
backbone to another. 

Ballinger et al. (WO 96/27671) describe subtilisin variants with combination 
mutations (N62D/G166D, and optionally Y104D) having a shift of substrate 
specificity towards peptide or polypeptide substrates with basic amino acids at 
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the P1, P2 and P4 positions of the substrate. Suitable substrates of the variant 
subtilisin were revealed by sorting a library of phage particles (substrate phage) 
containing five contiguous randomized residues. These subtilisin variants are 
useful for cleaving fusion proteins with basic substrate linkers and processing 
hormones or other proteins (in vitro or in vivo) that contain basic cleavage sites. 
The problems associated with rational redesign of enzymes can partially be 
overcome by directed evolution (as disclosed in PCT/EP03/ 04864). These studies 
can be classified by their expression and selection systems. Genetic selection 
means to produce inside an organism an enzyme, e.g. a protease, which is able 
to cleave a precursor protein which in turn results in an alteration of the growth 
behavior of the producing organism. From a population of organisms with 
different proteases those can be selected which have an altered growth behavior. 
This principle was for example reported by Davis et al. (US 5258289, WO 
96/21009). The production of a phage system is dependent on the cleavage of a 
phage protein which only can be activated in the presence of a proteolytic 
enzyme which is able to cleave the phage protein. Other approaches use a 
reporter system which allows a selection by screening instead of a genetic 
selection, but also cannot overcome the intrinsic insufficiency of the intracellular 
characterization of enzymes. 

Systems to generate enzymes with altered sequence specificities with self- 
secreting enzymes are also reported. Duff et al. (WO 98/11237) describe an 
expression system for a self-secreting protease. An essential element of the 
experimental design is that the catalytic reaction acts on the protease itself by an 
autoproteolytic processing of the membrane-bound precursor molecule to release 
the matured protease from the cellular membrane into the extracellular 
environment. Therefore, a fusion protein must be constructed where the target 
peptide sequence replaces the natural cleavage site for autoproteolysis. 
Limitations of such a system are that positively identified proteases will have the 
ability to cleave a certain amino acid sequence but they also may cleave many 
other peptide sequences. Therefore, high substrate specificity cannot be 
achieved. Additionally, such a system is not able to control that selected 
proteases cleave at a specific position in a defined amino acid sequence and it 
does not allow a precise characterization of the kinetic constants of the selected 
proteases (k ca t, K M ). 
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A method has been described that aims at the generation of new catalytic 
activities and specificities within the a/B-barrel proteins (WO 01/42432; Fersht et 
al, Methods of producing novel enzymes; Altamirano et al. (2000) Nature 403, 
617-622). The cc/B-barrel proteins comprise a large superfamily of proteins 
accounting for a large fraction of all known enzymes. The structure of the 
proteins is made from a/3-barrel surrounded by a- helices. The loops connecting 
(3-strands and helices comprise the so-called lid-structure including the acitve 
site residues. The method is based on the classification of oo/B-barrel proteins 
into two classes based on the catalytic lid structure. An extensive comparison of 
oc/6-barrel protein structures led the authors to the conclusion that the substrate 
binding and specificity is primarily defined by the barrel structure while the 
specificity of the chemical reaction resides within the loops. It is suggested that 
barrels and lid structures from different enzymes can be combined to generate 
new enzymatic activities and to provide a starting point to fine tune the 
properties by targeted or randomized mutagenesis and selection. The method 
does not provide for the generation of user-defined specificity. 

In summary, it is clear that there are many possible applications in the fields of 
therapeutics, research and diagnostics, industrial enzymes, food and feed 
processing, cosmetics and other areas that would become possible by the 
availability of enzymes with a novel substrate specificity. However, only a limited 
number of specific enzymes has been identified from natural sources so far. 
Methods of rational design to modify, alter, convert or transfer sequence 
specificity as well as random approaches described above did not enable the 
generation of a novel and user-definablespecificity that was not present in the 
employed starting material. 

Therefore, none of the currently available methods can provide enzymes with a 
novel and user-defined sequence specificity. In contrast, the current invention 
provides such enzymes as well as methods for generating them. 

Summary of the Invention 

The objective of the present invention is to provide a method for the treatment 
of a disease by applying a medicament comprising a protease. Further the 
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present invention provides engineered proteins with novel functions that do not 
exist in the components used for the engineering of such proteins. In particular, 
the invention provides enzymes with user-definable specificities. User-definable 
specificity means that enzymes are provided with specificities that do not exist in 
the components used for the engineering of such enzymes. The specificities can 
be chosen by the user so that one or more intended target substrates are 
preferentially recognised and converted by the enzymes. Furthermore, the 
invention provides enzymes that possess essentially identical sequences to 
human proteins but have different specificities. In a particular embodiment, the 
invention provides proteases with user-definable specificities. 

Furthermore, the present invention is directed to engineered enzymes which are 
fused to one or more further functional components. These further components 
can be proteinacious components which preferably have binding properties and 
are of the group consisting of substrate binding domains, antibodies, receptors or 
fragments thereof. Furthermore, these further components can be further 
functional components, preferably being selected from the group consisting of 
polyethylenglycols, carbohydrates, lipids, fatty acids, nucleic acids, metals, metal 
chelates, and fragments or derivatives thereof. The resulting fusion proteins are 
understood as enzymes with user-definable specificities within the present 
invention. 

Besides, the invention is directed to the application of such enzymes with novel, 
user-definable specificities for therapeutic, research, diagnostic, nutritional, 
personal care or industrial purposes. Moreover, the invention is directed to a 
method for generating engineered enzymes with user-definable specificities. In 
particular, the invention is directed to generate enzymes that possess essentially 
identical sequences to human enzymes but have different specificities. 

This problem has been solved by the embodiments of the invention specified in 
the description below and in the claims. The present invention is thus directed to 
(1) the use of a protease with defined specificity for a target substrate for 
preparing a medicament for the treatment of a specific disease related to said 
target substrate, 
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(2) an engineered enzyme with defined specificity characterized by the 
combination of the following components,: 

(a) a protein scaffold which catalyzes at least one chemical reaction on at least 
one substrate, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between at least one target substrate and one or more different substrates, and 
wherein the SDRs are essentially synthetic peptide sequences; 

(3) the use of an engineered enzyme as defined in (2) above for therapeutic, 
research, diagnostic, nutritional, personal care or industrial purposes, preferably 
for the use as defined in (1 ) above; 

(4) a method for generating engineered enzymes as defined in (2) above having 
specificities towards target substrates, such specificities not being present in the 
individual starting components, comprising at least the following steps: 

(a) providing a protein scaffold which catalyzes at least one chemical reaction on 
at least one substrate, 

(b) generating a library of engineered enzymes by combining the protein scaffold 
from step (a) with fully or partially random peptide sequences at sites in the 
protein scaffold that enable the resulting engineered enzyme to discriminate 
between at least one target substrate and one or more different substrates, and 

(c) selecting out of the library of engineered enzymes generated in step (b) one 
or more enzymes that have specificities towards at least one target substrate; 

(5) a fusion protein which is comprised of at least one engineered enzyme as 
defined in (2) above and at least one further component, preferably the at least 
one further component having binding properties and more preferably being 
selected from the group consisting of antiboides, binding domains, receptors, and 
fragments thereof; 

(6) a composition or pharmaceutical composition comprising one or more 
engineered enzymes as defined in (2) above or a fusion protein as defined in (5) 
above, said pharmaceutical composition may optionally comprise an acceptable 
carrier, excipient and/or auxiliary agent; 

(7) a DNA encoding the engineered enzyme as defined in (2) above; 

(8) a vector comprising the DNA as defined in (7) above; 
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(9) a host cell or transgenic organism being transform ed/transfected with a 
vector as defined in (8) above and/or containing the DNA as defined in (7) 
above; and 

(10) a method for producing the engineered enzyme of (2) above comprising 
cuituring a cell or organism as defined in (8) above and isolating the enzyme 
from the culture broth. 

Brief description of the Figures 

The following figures are provided in order to explain further the present 
invention in supplement to the detailed description: 

Figure 1 illustrates the three-dimensional structure of human trypsin I with the 
active site residues shown in "ball-and-stick" representation and with the marked 
regions indicating potential SDR insertion sites. 

Figure 2 shows the alignment of the primary amino acid sequence of three 
members of the serine protease class S1 family: human trypsin I , human alpha- 
thrombin and human enteropeptidase (see also SEQ ID NOs: 1, 5 and 6). 

Figure 3 illustrates the three-dimensional structure of subtilisin with the active 
site residues being shown in "ball-and-stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 4 shows the alignment of the primary amino acid sequences of four 
members of the serine protease class S8 family: subtilisin E, furin, PC1 and PCS 
(see also SEQ ID NOs: 7-10). 

Figure 5 illustrates the three-dimensional structure of pepsin with the active site 
residues being shown in "ball-and-stick" representation and with the numbered 
regions indicating potential SDR insertion sites. 

Figure 6 shows the alignment of the primary amino acid sequences of three 
members of the A1 aspartic acid protease family: pepsin, (3-secretase and 
cathepsin D (see also SEQ ID NOs: 11-13). 
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Figure 7 : illustrates the three-dimensional structure of caspase 7 with the active 
site residues being shown in "ball-and-stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 8 : shows the primary amino acid sequence of caspase 7 as a member of 
the cysteine protease class C14 family (see also SEQ ID NO: 14). 

Figure 9 depicts schematically the third aspect of the invention. 

Figure 10 shows a Western blot analysis of a culture supernatant of cells 
expressing variants of human trypsin I with SDR1 and SDR2, compared to 
negative controls. 

Figure 1 1 shows the time course of the proteolytic cleavage of a target substrate 
by human trypsin I. 

Figure 12 shows the relative activities of three variants of inventive engineered 
proteolytic enzymes in comparison with human trypsin I on two different peptide 
substrates. 

Figure 13 shows the relative specificities of human trypsin I and variants of 
inventive engineered proteolytic enzymes with one or two SDRs, respectively. 

Figure 14 : shows the relative specificities of human trypsin I and of variants of 
inventive engineered proteolytic enzymes being specific for human TNF-alpha 
with this scaffold on peptides with a target sequence of human TNF-alpha. 

Figure 15 : shows the reduction of cytotoxicity induced by TNF-alpha when 
incubating the TNF-alpha with concentrated supernatant from cultures 
expressing the inventive engineered proteolytic enzymes being specific for 
human TNF-alpha. 

Figure 16 : shows the reduction of cytotoxicity induced by TNF-alpha when 
incubating the TNF-alpha with purified inventive engineered proteolytic enzyme 
being specific for human TNF-alpha. 
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Figure 17 : compares the activity of inventive engineered proteolytic enzymes 
being specific for human TNF-alpha with the activity of human trypsin I on two 
protein substrates: (a) human TNF-alpha; (b) mixture of human serum proteins. 

Rgure 18 : showes the specific activity of an inventive engineered proteolytic 
enzyme with specificity for human VEGF. 

Definitions 

In the framework of the present invention the following terms and definitions are 
used. 

The term "protease" means any protein molecule that is capable of hydrolysing 
peptide bonds. This includes naturally-occurring or artificial proteolytic enzymes, 
as well as variants thereof obtained by site-directed or random mutagenesis or 
any other protein engineering method, any active fragment of a proteolytic 
enzyme, or any molecular complex or fusion protein comprising one of the 
aforementioned proteins. A "chimera of proteases" means a fusion protein of two 
or more fragments derived from different parent proteases. 

The term "substrate" means any molecule that can be converted catalytically by 
an enzyme. The term "peptide substrate" means any peptide, oligopeptide, or 
protein molecule of any amino acid composition, sequence or length, that 
contains a peptide bond that can be hydrolyzed catalytically by a protease. The 
peptide bond that is hydrolyzed is referred to as the "cleavage site". Numbering 
of positions in the substrate is done according to the system introduced by 
Schlechter & Berger (Biochem. Biophys. Res, Commun. 27 (1967) 157-162). 
Amino acid residues adjacent N-terminal to the cleavage site are numbered Fj, 
P 2 , P 3a etc., whereas residues adjacent C-terminal to the cleavage site are 
numbered P^, P 2 \ P 3 \ etc. 

The term "target substrate" describes a user-defined substrate which is 
specifically recognized and converted by an enzyme according to the invention. 
The term "target peptide substrate" describes a user-defined peptide substrate. 
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The term "target specificity" describes the qualitative and quantitative specificity 
of an enzyme that is capable of recognizing and converting a target substrate. 
Catalytic properties of enzymes are expressed using the kinetic parameters "Km" 
or "Michaelis Menten constant", "k cat " or "catalytic rate constant", and "k^/IV 
or "catalytic efficiency", according to the definitions of Michaelis and Menten 
(Fersht, A., Enzyme Structure and Mechanism, W. H. Freeman and Company, 
New York, 1995). The term "catalytic activity" describes quantitatively the 
conversion of a given substrate under defined reaction conditions. 

The term "specificity" means the ability of an enzyme to recognize and convert 
preferentially certain substrates. Specificity can be expressed qualitatively and 
quantitatively. "Qualitative specificity" refers to the chemical nature of the 
substrate residues that are recognized by an enzyme. "Quantitative specificity" 
refers to the number of substrates that are accepted as substrates. Quantitative 
specificity can be expressed by the term s, which is defined as the negative 
logarithm of the number of all accepted substrates divided by the number of all 
possible substrates. Proteases, for example, that accept preferantially a small 
portion of all possible peptide substrates have a "high specificity". Proteases that 
accept almost any peptide substrate have a "low specificity". Definitions are 
made in accordance to WO 03/095670 which is therefore incorporated by 
reference. Proteases with very low specificity are also referred to as "unspecific 
proteases". The term "defined specificity" refers to a certain type of specificity, 
i.e. to a certain target subtrate or a set of certain target substrates that are 
preferentially converted versus other substrates. 

The term "engineered" in combination with the term "enzyme" describes an 
enzyme that is comprised of different components and that has features not 
being conferred by the individual components alone. 

The term "protein scaffold" or "scaffold protein" refers to a variety of primary, 
secondary and tertiary polypeptide structures. 

The term "peptide sequence" indicates any peptide sequence used for insertion 
or substitution into or combination with a protein scaffold. Peptide sequences are 
usually obtained by expression from DNA sequences which can be synthesized 
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according to well-established techniques or can be obtained from natural 
sources. Insertion, substitution or combination of peptide sequences with the 
protein scaffold are generated by insertion, substitution or combination of 
oligonucleotides into or with a polynucleotide encoding the protein scaffold. The 
term "synthetic" in combination with the term "peptide sequence" refers to 
peptide sequences that are not present in the protein scaffold in which the 
peptide sequences are inserted or substituted or with which they are combined. 

The term "components" in combination with the term "engineered enzyme" 
refers to peptide or polypeptide sequences that are combined in the engineering 
of such enzymes. Such components may among others comprise one or more 
protein scaffolds and one or more synthetic peptide sequences. The term "library 
of engineered enzymes" describes a mixture of engineered enzymes, whereby 
every single engineered enzyme is encoded by a different polynucleotide 
sequence. The term "gene library" indicates a library of polynucleotides that 
encodes the library of engineered enzymes. The term "SDR" or "Specificity 
determining region" refers to a synthetic peptide sequence that provides the 
defined specificity when combined with the protein scaffold at sites that enable 
the resulting enzymes to discriminate between the target substrate and one or 
more other substrates. Such sites are termed "SDR sites". 

The terms "tertiary structure similar to the structure of" and "similar tertiary 
structure" in combination with the terms "enzyme" or "protein" refer to proteins 
in which the type, sequence, connectivity and relative orientation of the typical 
secondary structural elements of a protein, e.g. alpha-helices, beta-sheets, beta- 
turns and loops, are similar and the proteins are therefore grouped into the same 
structural or topological class or fold. This includes proteins that have altered, 
additional or deleted structural elements of any type but otherwise unchanged 
topology. Examples of such structural classes are the TNF superfamily, the S1 
fold or the S8 fold within the serine proteases, the GPCRs, or the a/ 3- barrel fold. 

The term "positions that correspond structurally" indicates amino acids in 
proteins of similar tertiary structure that correspond structurally to each other, 
i.e. they are usually located within the same structural or topological element of 
the structure. Within the structural element they possess the same relative 
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positions with respect to beginning and end of the structural element. If, e.g. the 
topological comparison of two proteins reveals two structurally corresponding 
sequences of different length, then amino acids within, e.g. 20% and 40% of the 
respective region lengths, correspond to each other structurally. 

The term "library of engineered enzymes" of the present invention refers to a 
multiplicity of enzymes or enzyme variants, which may exist as a mixture or in 
isolated form. 

Amino acids residues are abbreviated according to the following Table 1 either in 
one- or in three-letter code. 

Table 1 : Amino acid abbreviations 



Abbreviations 


Am ino acid 


A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic acid 


E 


Glu 


Glutamic acid 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Histidine 


I 


lie 


Isoleucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


W 


Trp 


Tryptophane 


Y 


Tyr 


Tyrosine 
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Detailed description of the invention 

The present invention provides engineered proteins with novel functions. In 
particular, the invention provides enzymes with user-definable specificities. In a 
particular embodiment, the invention provides proteases with user-definable 
specificities. Besides, the invention provides applications of such enzymes with 
novel, user-definable specificities for therapeutic, research, diagnostic, 
nutritional, personal care or industrial purposes. Moreover, the invention 
provides a method for generating enzymes with specificities that are not present 
in the components used for the engineering of such enzymes. In particular, the 
invention is directed to the generation of enzymes that have sequences that are 
essentially identical to mammalian, especially human enzymes but have different 
specificities . Moreover, the invention provides libraries of specific engineered 
enzymes with corresponding specificities encoded genetically, a method for the 
generation of libraries of specific engineered enzymes with corresponding 
specificities encoded genetically, and the application of such libraries for 
technical, diagnostic, nutritional, personal care or research purposes. 

A first aspect of the invention is directed to the application of engineered 
enzymes with specificities for therapeutic, research, diagnostic, nutritional, 
personal care or industrial purposes. The application comprises at least the 
following steps: 

(a) identification of a target peptide substrate whose hydrolysis has a positive 
effect in connection with the intended purpose, such as curing a disease, 
diagnosing a disease, processing of ingredients for human or animal 
nutrition, or other technical processes; 

(b) provision of an engineered enzyme, the enzyme being specific for the 
target peptide identified in step (a); and 

(c) use of the enzyme as provided in step (b) for the intended purpose. 

In a first variant of this aspect of the invention, the engineered enzyme is used 
as a therapeutic means to inactivate a disease- related target substrate. This 
application comprises at least the following steps: 

(a) identification of a target substrate whose function is connected to a 
disease and whose inactivation has a positive effect in connection with the 
disease, and determination of a target site within the target substrate 
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characterized by the fact that modification at the target site leads to the 
inactivation of the target substrate; 

(b) provision of an engineered enzyme, the enzyme being specific for the 
target site identified in step (a); and 

(c) use of the enzyme for the inactivation of the target substrate inside or 
outside the human body. 

Preferably, the scaffold is a protease and the modification is hydrolysis of a 
target site in a protein target. Preferably, the hydrolysis leads to the activation or 
inactivation of the peptide or protein target. Potential peptide or protein targets 
include soluble proteins, in particular cytokines, such as proteins of the TNF- 
superfamily, interleukines, interferons, chemokines and growth factors; 
hormones; toxins; enzymes, such as oxidoreductases, transferases, hydrolases, 
lyases, isomerases and ligases; structural proteins, such as collagen; and 
immunoglobulins; or membrane associated proteins, in particular single pass 
transmembrane proteins; multipass transmembrane proteins, such as G- protein 
coupled receptors, ion channels and transporters; lipid-anchored membrane 
proteins and GPI-anchored membrane proteins. 

In a first embodiment of this variant the engineered enzyme is a protease and is 
capable of hydrolysing human tumor necrosis factor-alpha (hTNF-a). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, rheumatoid arthritis, 
inflammatory bowel diseases, psoriasis, Crohn's disease, Ulcerative colitis, 
diabetes type II, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's Syndrome, systemic lupus erythematosus, 
multiple sclerosis, Systemic inflammatory response syndrome (SIRS) which leads 
to distant organ damage and multiple organ dysfunction syndrome (MODS), 
eosinophilia, neurodegenerative disease, stroke, closed head injury, encephalitis, 
GNS disorders, asthma, rheumatoid arthritis, sepsis, vasodilation, intravascular 
coagulation and multiple organ failure, as well as other diseases connected with 
hTNF-a. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hTNF-a (SEQ ID NO:96). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 31/32, 
32/33, 44/45, 45/46, 87/88, 128/129, 130/131, 140/141 and/or 141/142 (most 
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preferred between positions 31/32, 32/33 and/or 45/46) in hTNF-cc, or a peptide 
bond in proximity to these positions in hTNF-oc, or peptide bonds in protein 
targets related to hTNF-a between positions having structural homology or 
sequence homology to these positions. In this embodiment it is most preferred 
that the protease has the a sequence shown in SEQ ID NO:74, SEQ ID NO:75 
and is capable of hydrolysing hTNF-a at positions 31/32 and/or 32/33. 

In a second embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Tumor necrosis factor ligand superfamily member 5 
(hCD40-L). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, systemic 
lupus erythematosus and classical Hodgkin's Lymphoma (cHL), as well as other 
diseases connected with hCD40-L. Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hCD40-L (SEQ ID NO: 143), More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 133/134, 145/146, 165/166, 200/201, 201/202, 207/208 
and/or 216/217 (most preferred between positions 133/134, 165/166, 201/202 
and/or 216/217) in hCD40-L, or a peptide bond in proximity to these positions in 
hCD40~L, or peptide bonds in protein targets related to hCD40-L at positions 
having structural homology or sequence homology to these positions. 

In a third embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Macrophage migration inhibitory factor (hMIF). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, inflammatory diseases, as well 
as other diseases connected with hMIF. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hMIF (SEQ ID NO; 109). More 
preferably said engineered or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 16/17, 44/45, 66/67, 73/74, 77/78, 88/89, 
92/93 and/or 100/101 (most preferred between positions 16/17 and/or 92/93) in 
hMI F, or a peptide bond in proximity to these positions in hMI F, or peptide bonds 
in protein targets related to hMIF at positions having structural homology or 
sequence homology to these positions. 

In a fourth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human lnterleukin-1 beta precursor (hlL-1 beta). The enzymes or 
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the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, diabetes, brain inflammation in cancer, 
arthritis, autoimmune and inflammatory diseases, as well as other diseases 
connected with hIL-1 beta. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hIL-1 beta (SEQ ID NO: 112). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 24/25, 35/36, 46/47, 54/55, 74/75, 75/76, 76/77, 77/78, 
.86/87, 88/89, 93/94, 94/95, 97/98 and/or 150/151 (most preferred between 
positions 35/36, 75/76, 76/77, 88/89, 93/94, 94/95 and/or 150/151) in hiL-1 
beta, or a peptide bond in proximity to these positions in hIL-1 beta, or peptide 
bonds in protein targets related to hIL-1 beta at positions having structural 
homology or sequence homology to these positions. 

In a fifth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Interleukin 2 (hlL-2). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, Tcell leukemia and hairy cell leukemia, Crohn's disease, 
Ulcerative colitis, Grave's disease, Hashimoto's thyroiditis, Sjogren's syndrome, 
systemic lupus erythematosus, multiple sclerosis, asthma and chronic obstructive 
pulmonary and classical Hodgkin's Lymphoma (cHL), as well as other diseases 
connected with hlL-2. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlL-2 (SEQ ID NO: 99). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
20/21, 32/33, 38/39, 43/44, 45/46 48/49, 49/50, 54/55, 64/65, 76/77, 83/84, 
84/85, 107/108, 109/110 and/or 120/121 (most preferred between positions 
109/1 10) in hlL-2, or a peptide bond in proximity to these positions in hlL-2, or 
peptide bonds in protein targets related to hlL-2 at positions having structural 
homology or sequence homology to these positions. 

In a sixth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Interleukin 3 (hiL-3). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, classical Hodgkin's Lymphoma (cHL) and eosinophilia, as well 
as other diseases connected withh IL-3. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hlL-3 (SEQ ID NO: 148). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
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peptide bonds between positions 21/22, 28/29, 36/37, 44/45, 46/47, 51/52, 
63/64, 66/67, 79/80, 94/95, 101/102, 108/109 and/or 109/110 (most preferred 
between positions 21/22, 28/29, 46/47, 63/64, 66/67, 79/80 and/or 101/102) in 
hlL-3 5 or a peptide bond in proximity to these positions in hlL-3, or peptide 
bonds in protein targets related to hlL-3 at positions having structural homology 
or sequence homology to these positions. 

In a seventh embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interleukin 4 (hlL-4). The enzymes or the fusion protein 
can thus be used for preparing medicaments for the treatment of diseases, such 
as, but not limited to, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, Asthma, chronic obstructive 
pulmonary disease and allergic inflammatory reactions, as well as other diseases 
connected with hlL-4. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlL-4 (SEQ ID NO: 11 8). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
4/5, 12/13, 31/32, 37/38, 61/62, 62/63, 64/65, 91/92, 102/103, 121/122 
and/or 126/127 (most preferred between positions 4/5, 61/62, 62/63, 64/65 
and/or 121/122) in hlL-4, or a peptide bond in proximity to these positions in 
hlL-4, or peptide bonds in protein targets related to hIL-4 at positions having 
structural homology or sequence homology to these positions. 

In a eighth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human lnterleukin-5 (hlL-5). The enzymes or the fusion protein 
can thus be used for preparing medicaments for the treatment of diseases, such 
as, but not limited to, classical Hodgkin's Lymphoma (cHL), asthma, chronic 
obstructive pulmonary disease, eosinophilia, allergic inflammatory diseases, as 
well as other diseases connected with hlL-5. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hlL-5 (SEQ ID NO: 133). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 12/13, 32/33, 67/68, 76/77, 77/78, 80/81, 
83/84, 84/85, 85/86, 90/91, 91/92, 92/93 and/or 98/99 (most preferred 
between positions 90/91, 91/92, 92/93 and/or 98/99) in hlL-5, or a peptide 
bond in proximity to these positions in hlL-5, or peptide bonds in protein targets 
related to hlL-5 at positions having structural homology or sequence homology 
to these positions. 
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in a ninth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human lnterleukin-6 (hlL-6). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, classical Hodgkin's Lymphoma (cHL), breast cancer, renal cell 
carcinoma, multiple myeloma, lymphoma, leukemia, Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
Systemic inflammatory response syndrome (SIRS) which leads to distant organ 
damage and multpile organ dysfunction syndrome (MODS), chronic obstructive 
pulmonary disease (COPD), Castleman's diseases, inflammatory bowel diseases, 
Crohn's disease, as well as other diseases connected with hlL-6. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hlL-6 (SEQ ID 
NO: 134). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 32/33, 35/36, 55/56, 71/72, 
129/130, 130/131, 132/133, 135/136, 141/142, 161/162, 180/181 and/or 
183/184 (most preferred between positions 135/136 and/or 141/142) in hlL-6, 
or a peptide bond in proximity to these positions in h!L-6, or peptide bonds in 
protein targets related to hlL-6 at positions having structural homology or 
sequence homology to these positions. 

In a tenth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Interleukin 8 (hlL-8). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, Crohn's disease, Ulcerative colitis, classical Hodgkin's 
Lymphoma (cHL), Systemic inflammatory response syndrome (SIRS) which leads 
to distant organ damage and multple organ dysfunction syndrome (MODS), 
chronic obstructive pulmonary disease (COPD), endometriosis, psoriasis and 
atherosclerotic lesions, as well as other diseases connected with hlL-8. 
Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hlL-8 (SEQ ID NO: 100). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 1/12, 
15/16, 45/46, 47/48, 52/53, 54/55, 60/61, 64/65 and/or 67/68 (most preferred 
between positions 45/46) in hlL-8, or a peptide bond in proximity to these 
positions in hlL-8, or peptide bonds in protein targets related to hlL-8 at 
positions having structural homology or sequence homology to these positions. 
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In a eleventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human lnterleukin-10 (hlL-10). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, classical Hodgkin's Lymphoma (cHL) and 
diseases related to the suppression of cytotoxic T-cells, as well as other diseases 
connected with hlL-10. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating ML- 10 (SEQ ID NO: 135). More preferably said enzyme 
or said fusion protein is capable of hydrolysing the peptide bonds between 
positions 24/25 5 2 5/26 , 27/28 , 28/29 , 40/41, 44/45 , 49/50 , 57/58 , 59/60, 
84/85, 86/87, 106/107, 107/108, 1 10/111, 130/131, 134/135, 137/138, 
138/139 and/or 144/145 (most preferred between positions 24/25, 27/28, 
44/45, 49/50, 86/87, 137/138 and/or 144/145) in hlL-10, or a peptide bond in 
proximity to these positions in h I L- 1 0 , or peptide bonds in protein targets related 
to hlL-1 0 at positions having structural homology or sequence homology to these 
positions. 

In a twelfth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interleukin 12 beta chain (ML-12B). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease and classical Hodgkin's 
Lymphoma (cHL), as well as other diseases connected with hlL-126. Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hIL-12B 
(SEQ ID NO:97). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 14/15, 18/19, 29/30, 34/35, 
87/88, 99/100, 102/103, 104/105, 161/162, 174/175, 222/223, 225/226, 
228/229, 238/239, 268/269 and/or 293/294 (most preferred between positions 
18/19, 34/35, 87/88 and/or 161/162) in hlL-1213, or a peptide bond in proximity 
to these positions in hlL-1213, or peptide bonds in protein targets related to hlL- 
1213 at positions having structural homology or sequence homology to these 
positions. 

In a thirteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interleukin 13 (hlL-13). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
cancer, such as, but not limited to, classical Hodgkin's Lymphoma (cHL), 
eosinophil, asthma, chronic obstructive pulmonary disease, fibrosis, psoriasis, 
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atopic dermatitis and Ulcerative colitis, as well as other diseases connected with 
hlL-13. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hlL-13 (SEQ ID NO:1 19). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 25/26, 
62/63, 65/66, 86/87, 87/88, 98/99, 108/109 and/or 111/112 (most preferred 
between positions 87/88) in hlL-13, or a peptide bond in proximity to these 
positions in hlL-13, or peptide bonds in protein targets related to hlL-13 at 
positions having structural homology or sequence homology to these positions. 

In a fourteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interleukin 18 (hlL-18). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease, inflammation liver injuries, 
pulmonary tuberculosis, plural tuberculosis and rheumatoid arthritis, as well as 
other diseases connected with hlL-18. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hlL-18 (SEQ ID NO: 98). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 17/18, 32/33, 37/38, 39/40, 40/41, 53/54, 
58/59, 79/80, 90/91, 93/94, 98/99, 110/111, 120/121, 123/124, 131/132, 
132/133, 142/143, 147/148 and/or 157/158 (most preferred between positions 
37/38, 132/133, 142/143 and/or 157/158) in hlL-18, or a peptide bond in 
proximity to these positions in hlL-18, or peptide bonds in protein targets related 
to hlL-18 at positions having structural homology or sequence homology to these 
positions. 

In a fifteenth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interferon-gamma (hlFN-gamma). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, classical Hodgkin's Lymphoma (cHL), 
Crohn's disease and type I diabetes, as well as other diseases connected with 
hlFN-gamma. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlFN-gamma (SEQ ID NO:137). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 2/3, 6/7, 13/14, 21/22, 24/25, 34/35, 36/37, 37/38, 62/63, 
68/69, 83/84, 86/87, 90/91, 102/103, 107/108 and/or 108/109 (most preferred 
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between positions 13/14, 24/25, 37/38, 62/63, 68/69, 102/103 and/or 107/108) 
in hlFN-gamma, or a peptide bond in proximity to these positions in hlFN- 
gamma, or peptide bonds in protein targets related to hlFN-gamma at positions 
having structural homology or sequence homology to these positions. 

In a sixteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human small inducible cytokine A2 (hCCL2). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, Crohn's disease and Ulcerative 
colitis, as well as other diseases connected with hCCL2. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hCCL2 (SEQ ID 
NO: 1 02). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 3/4, 13/14, 18/19, 19/20, 
24/25, 29/30, 38/39, 54/55, 56/57, 58/59, 62/63, 65/66 and/or 68/69 (most 
preferred between positions 19/20, 29/30, 38/39, 54/55 and/or 62/63) in 
hCCL2, or a peptide bond in proximity to these positions in hCCL2, or peptide 
bonds in protein targets related to hCCL2 at positions having structural homology 
or sequence homology to these positions. 

In a seventeenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Eotaxin (hCCL11). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease and Ulcerative colitis, 
classical Hodgkin's Lymphoma (cHL), chronic pathophysiologic dysfunction, 
characterized by an influx mainly of Th2 cells, and eosinophilia, as well as other 
diseases connected with hCCL1 1 . Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hCCL1 1 (SEQ ID NO:101). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 1 1/12, 16/17, 17/18, 22/23, 27/28, 33/34, 44/45, 47/48, 
48/49, 52/53, 54/55, 56/57, 60/61, 66/67 and/or 73/74 (most preferred 
between positions 48/49 and/or 66/67) in hCCL11, or a peptide bond in 
proximity to these positions in hCCL1 1 , or peptide bonds in protein targets 
related to hCCL1 1 at positions having structural homology or sequence homology 
to these positions. 

In an eighteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor (hVEGF). The 
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enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, all solid tumors and 
metastatic solid tumors, inflammatory breast cancer, as well as other diseases 
connected with hVEGF. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating hVEGF (SEQ ID NO: 103). More preferably said enzyme 
or said fusion protein is capable of hydrolysing the peptide bonds between 
positions 16/17, 19/20, 23/24, 34/35, 41/42, 56/57, 62/63, 63/64, 64/65, 
65/66, 82/83, and/or 84/85 (most preferred between positions 23/24, 41/42, 
63/64, 82/83 and/or 84/85) in hVEGF, or a peptide bond in proximity to these 
positions in hVEGF, or peptide bonds in protein targets related to hVEGF at 
positions having structural homology or sequence homology to these positions. 

In an ninteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Transforming growth factor beta 1 (hTGF-B1). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers, 
including breast cancer, colorectal cancer and classical Hodgkin's Lymphoma 
(cHL), fibrosis, suppression of cell-mediated immunity, glaucoma, diffuse 
systemic sclerosis as well as other diseases connected with hTGF- 81. Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hTGF- 81 
(SEQ ID NO: 104). More preferably said enzyme or said fusion protein is capable 
of hydrolysing the peptide bonds between positions 23/24, 25/26, 26/27, 27/28, 
37/38, 55/56 and/or 94/95 (most preferred between positions 25/26, 55/56 
and/or 94/95) in hTGF- 81 , or a peptide bond in proximity to these positions in 
hTGF- 61, or peptide bonds in protein targets related to hTGF 81 at positions 
having structural homology or sequence homology to these positions. 

In a twentieth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Somatotropin (human Growth hormone; hGH). 
The enzymes or the fusion protein can thus be used for preparing medicaments 
for the treatment of diseases, such as, but not limited to, acromegaly, diabetes 
and diabetic kidney disease including renal hypertrophy and glomerular 
enlargement and cardiovascular disorders, as well as other diseases connected 
with hGH. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hGH (SEQ ID NO:121). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 8/9, 
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16/17, 19/20, 26/27, 33/34, 38/39, 41/42, 70/71, 77/78, 94/95, 103/104, 
112/113, 115/116, 116/117, 130/131, 147/148, 154/155 and/or 178/179 (most 
preferred between positions 112/113, 147/148 and/or 154/155) in hGH, or a 
peptide bond in proximity to these positions in hGH, or peptide bonds in protein 
targets related to hGH at positions having structural homology or sequence 
homology to these positions. 

In a twenty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Insulin- like growth factor II (hIGF-ll). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, diabetes and diabetic 
kidney disease, as well as other diseases connected with hIGF-ll. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hIGF-ll (SEQ 
ID NO: 122). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 15/16, 23/24, 24/25, 34/35, 
37/38, 38/39, 48/49 and/or 49/50 (most preferred between positions 23/24) in 
hIGF-ll, or a peptide bond in proximity to these positions in hIGF-ll, or peptide 
bonds in protein targets related to hIGF-ll at positions having structural 
homology or sequence homology to these positions. 

In a twenty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Hepatocyte growth factor (hHGF). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, angiogenic disorders and hepatocellular 
carcinoma, as well as other diseases connected with hHGF. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hHGF (SEQ ID 
NO: 120). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 54/55, 60/61, 62/63, 63/64, 
68/69, 76/77, 112/113, 123/124, 134/135, 168/169, 198/199 and/or 202/203 
(most preferred between positions 63/64, 68/69, 76/77, 168/169 and/or 
202/203) in hHGF, or a peptide bond in proximity to these positions in hHGF, or 
peptide bonds in protein targets related to hHGF at positions having structural 
homology or sequence homology to these positions. 

In a twenty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human hlnsulin (hlnsulin). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
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diseases, such as, but not limited to, insulin overdosage, as well as other 
diseases connected with hlnsulin. Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hlnsulin B chain (SEQ ID NO: 105) and/or 
hlnsulin A chain (SEQ ID NO:106). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 16/17 
and/or 22/23 in hlnsulin B and/or between position 14/15 in Insulin A, or a 
peptide bond in proximity to these positions in hlnsulin A or B, or peptide bonds 
in protein targets related to hlnsulin A or B at positions having structural 
homology or sequence homology to these positions. 

In a twenty-fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human hGhrelin (hGhrelin). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, obesity, as well as other diseases 
connected with hGhrelin. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hGhrelin (SEQ ID NO: 107). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 1/2, 2/3, 3/4 and/or 4/5 in hGhrelin, or a peptide bond in 
proximity to these positions in hGhrelin, or peptide bonds in protein targets 
related to hGhrelin at positions having structural homology or sequence 
homology to these positions. 

In a twenty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human angiotensinogen (angiotensin). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, essential hypertension, as well as other 
diseases connected with angiotensin. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating angiotensin (SEQ ID NO: 108). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 1/2, 3/4 and/or 7/8 (most preferred between 
positions 3/4) in angiotensin, or a peptide bond in proximity to these positions in 
angiotensin, or peptide bonds in protein targets related to angiotensin at 
positions having structural homology or sequence homology to these positions. 

In a twenty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human leptin precursor (leptin). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
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diseases, such as, but not limited to, obesity, as well as other diseases 
connected with leptin. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating leptin (SEQ ID NO: 127). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
8/9, 9/10, 15/16, 23/24, 40/41, 53/54, 71/72, 85/86, 94/95, 108/109 and/or 
141/142 (most preferred between positions 9/10, 40/41, 71/72, 94/95 and/or 
108/109) in leptin, or a peptide bond in proximity to these positions in leptin, or 
peptide bonds in protein targets related to leptin at positions having structural 
homology or sequence homology to these positions. 

In a twenty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing Protective antigen (PA-83). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, anthrax infection, as well as other diseases 
connected with PA-83. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating PA-83 (SEQ ID NO: 123). More preferably said enzyme 
or said fusion protein is capable of hydrolysing the peptide bonds between 
positions 72/73, 73/74, 92/93, 93/94, 131/132, 149/150, 178/179, 213/214, 
214/215, 387/388, 425/426, 426/427, 427/428, 453/454, 520/521, 608/609, 
617/618, 671/672, 679/680, 680/681, 683/684 and/or 684/685 (most preferred 
between positions 72/73, 73/74, 93/94, 149/150, 387/388, 425/426, 427/428 
and/or 683/684) in PA-83, or a peptide bond in proximity to these positions in 
PA-83, or peptide bonds in protein targets related to PA-83 at positions having 
structural homology or sequence homology to these positions. 

In a twenty-eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human plasminogen (plasminogen). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, thrombosis, as well as other diseases 
connected with plasminogen. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating plasminogen (SEQ ID NO: 140). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bond between position 580/581 in plasminogen, or a peptide bond in 
proximity to this position in plasminogen, or peptide bonds in protein targets 
related to plasminogen at positions having structural homology or sequence 
homology to these positions. 
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In a twenty-ninth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Prothrombin (thrombin). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, bleeding, as well as other diseases 
connected with thrombin. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating thrombin (SEQ ID NO: 149). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 198/199, 327/328, 363/364 (most preferred between 
positions 327/328 and/or 363/364) in thrombin, or a peptide bond in proximity 
to these positions in thrombin, or peptide bonds in protein targets related to 
thrombin at positions having structural homology or sequence homology to these 
positions 

In a thirty embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human beta-secretase. The enzymes or the fusion protein can thus 
be used for preparing medicaments for the treatment of diseases, such as, but 
not limited to, Alzheimer, as well as other diseases connected with human beta- 
secretase precursor. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating human beta-secretase precursor (SEQ ID NO: 139). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 61/62, 64/65, 159/160, 238/239, 239/240, 
246/247, 256/257, 330/331 and/or 365/366 (most preferred between positions 
61/62, 246/247 and/or 365/366) in human beta-secretase precursor, or a 
peptide bond in proximity to these positions in human beta-secretase precursor, 
or peptide bonds in protein targets related to human beta-secretase precursor at 
positions having structural homology or sequence homology to these positions. 

in a thirty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human matrix metalloproteinase-2 (hMMP-2). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer and lung cancer, as 
well as other diseases connected with hMMP-2. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hMMP-2 (SEQ ID NO: 131). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 62/63, 68/69, 75/76, 76/77, 79/80, 88/89, 
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110/111, 112/113, 115/116, 120/121, 164/165, 254/255, 267/268, 296/297, 
324/325, 325/326, 382/383, 383/384, 470/471, 500/501, 550/551, 564/565, 
595/596, 597/598, 608/609, 646/647, 649/650 and/or 650/651 (most preferred 
between positions 68/69, 115/116, 120/121, 164/165, 325/326, 383/384, 
470/471, 500/501, 595/596, 608/609 and/or 650/651) in hMMP-2, or a peptide 
bond in proximity to these positions in hMMP-2, or peptide bonds in protein 
targets related to hMMP-2 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human matrix metalloproteinase-9 (hMMP-9). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer and lung cancer, as 
well as other diseases connected with hMMP-9. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hMMP-9 (SEQ ID NO: 132). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 41/42, 42/43, 106/107, 113/114, 134/135, 
160/161, 162/163, 163/164, 222/223, 226/227, 265/266, 266/267, 267/268, 
284/285, 309/310, 321/322, 322/323, 324/325, 356/357, 380/381, 433/434 
and/or 440/441 (most preferred between positions 160/161, 163/164, 226/227, 
284/285, 321/322, 322/323 and/or 433/434) in hMMP-9, or a peptide bond in 
proximity to these positions in hMMP-9, or peptide bonds in protein targets 
related to hMMP-9 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing HIV membrane glycoprotein (GP120). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, AIDS or HIV infection, as well as other 
diseases connected with GP120 or HIV infection. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating GP120 (SEQ ID NO: 124). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 97/98, 99/100, 107/108, 113/1 14, 117/118, 
227/228, 231/233, 279/280, 335/336, 337/338, 368/369, 412/413, 419/420, 
429/430, 444/445, 457/458, 474/475, 476/477, 477/478, 485/486 and/or 
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490/491 (most preferred between positions 99/100, 368/369, 412/413, 
419/420, 444/445 and/or 490/491) in GP120, or a peptide bond in proximity to 
these positions in GP120, or peptide bonds in protein targets related to GP120 at 
positions having structural homology or sequence homology to these positions. 

In a thirty-fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Cytotoxic T-lymphocyte protein 4 (hCTLA-4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, breast cancer, as well as 
other diseases connected with hCTLA-4. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hCTLA-4 (SEQ ID NO: 144). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 14/15, 28/29, 33/34, 38/39, 41/42, 62/63, 
72/73, 85/86, 95/96, 100/101, 105/106, 1 19/120, 125/126 and/or 127/128 
(most preferred between positions 14/15, 28/29, 38/39, 41/42, 62/63 and/or 
85/86) in hCTLA-4, or a peptide bond in proximity to these positions in hCTLA-4, 
or peptide bonds in protein targets related to hCTLA-4 at positions having 
structural homology or sequence homology to these positions. 

In a thirty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Integrin alpha-2 (hVLA-2). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, renal tumors, uveal melanomas and 
gastrointestinal tumors, as well as other diseases connected with hVLA-2. 
Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hVLA-2 (SEQ ID NO: 147). More preferably said enzyme or said 
fusion protein is capable of hydrolysing the peptide bonds between positions 
160/161, 174/175, 201/202, 219/220, 231/232, 232/233, 233/234, 243/244, 
259/260, 264/265, 268/269, 288/289, 292/293, 294/295, 298/299, 301/302, 
310/311 and/or 317/318 (most preferred between positions 160/161, 174/175, 
201/202, 219/220, 243/244, 264/265, 292/293 and/or 294/295) in hVLA-2, or a 
peptide bond in proximity to these positions in hVLA-2, or peptide bonds in 
protein targets related to hVLA-2 at positions having structural homology or 
sequence homology to these positions. 

In a thirty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor receptor 1 
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(hVEGFR 1). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, solid 
tumors and metastatic solid tumors, astrocytic brain tumors, pancreatic cancer, 
metastatic renal cancer, metastatic solid tumors, as well as other diseases 
connected with hVEGFR 1. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hVEGFR 1 (SEQ ID NO: 114). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 189/190, 190/191, 224/225 and/or 331/332 (most preferred 
between positions 189/190 and/or 331/332) in hVEGFR 1 , or a peptide bond in 
proximity to these positions in hVEGFR 1, or peptide bonds in protein targets 
related to hVEGFR 1 at positions having structural homology or sequence 
homology to these positions. 

in a thirty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor receptor 2 
(hVEGFR 2). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, solid 
tumors and metatstatic solid tumors, pancreatic cancer, metastatic renal cancer, 
metastatic CRC, as well as other diseases connected with hVEGFR 2. Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hVEGFR 
2 (SEQ ID NO: 11 5). More preferably said enzyme or said fusion protein is 
capable of hydrolysing the peptide bonds between positions 214/215, and/or 
323/324 (most preferred between position 214/215) in hVEGFR 2 , or a peptide 
bond in proximity to these positions in hVEGFR 2, or peptide bonds in protein 
targets related to hVEGFR 2 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Epidermal growth factor receptor (hEGFr). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of disesaes, such as, but not limited to, bladder cancer, breast 
cancer, cervical cancer, colorectal cancer, endometrial cancer, oesophageal 
cancer, head and neck cancer, gastric cancer, non-small-ceil lung carcinoma and 
ovarian cancer, as well as other diseases connected with hEGFr. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hEGFr (SEQ 
ID NO: 11 6). More preferably said enzyme or said fusion protein is capable of 
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hydrolysing the peptide bonds between positions 20/21, 29/30, 48/49, 74/75, 
165/166, 202/203, 220/221, 246/247, 251/252, 269/270, 270/271, 304/305, 
305/306, 357/358, 430/431, 443/444, 454/455, 455/456, 463/464, 465/466, 
476/477, 507/508 and/or 509/510 in hEGFr, or a peptide bond in proximity to 
these positions in hEGFr, or peptide bonds in protein targets related to hEGFr at 
positions having structural homology or sequence homology to these positions. 

In a thirty-ninth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Epithelial cell adhesion molecule (hEp-CAM). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, colorectal cancer, as well 
as other diseases connected with hEp-CAM. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hEp-CAM (SEQ ID NO: 125). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 14/15, 19/20, 25/26, 30/31, 33/34, 55/56 
an/or 70/71 (most preferred between positions 14/15, 30/31 and/or 70/71) in 
hEp-CAM, or a peptide bond in proximity to these positions in hEp-CAM, or 
peptide bonds in protein targets related to hEp-CAM at positions having 
structural homology or sequence homology to these positions. 

In a forty embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human insulin- like growth factor I receptor (hlGF-1r). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, a variety of cancers including 
breast cancer, as well as other diseases connected with hlGF-1r. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating h!GF-1r (SEQ 
ID NO: 126). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 59/60, 115/116, 146/147, 
171/172, 191/192, 290/291, 306/307, 307/308, 335/336, 336/337, 455/456 
and/or 470/471 (most preferred between positions 306/307, 307/308, 335/336 
and/or 470/471) in hlGF-1r, or a peptide bond in proximity to these positions in 
hlGF-1r, or peptide bonds in protein targets related to hlGF-1r at positions 
having structural homology or sequence homology to these positions. 

In a forty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human T-cell surface antigen CD2 precursor (hCD2). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
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the treatment of diseases, such as, but not limited to, psoriasis, as well as other 
diseases connected with hCD2. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hCD2 (SEQ ID NO: 128). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 42/43, 43/44, 48/49, 49/50, 51/52, 54/55, 63/64, 69/70, 
89/90 and/or 91/92 (most preferred between positions 43/44, 51/52 and/or 
89/90) in hCD2, or a peptide bond in proximity to these positions in hCD2, or 
peptide bonds in protein targets related to hCD2 at positions having structural 
homology or sequence homology to these positions. 

In a forty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human T-cell surface glycoprotein CD4 (hCD4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, psoriasis, transplant 
rejection, graft-versus-host colitis, autoimmune disorders and rheumatoid 
arthritis, as well as other diseases connected with hCD4. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hCD4 (SEQ ID 
NO: 129). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 166/167, 167/168, 206/207, 
219/220, 224/225, 226/227, 251/252, 252/253, 322/323, 329/330 and/or 
334/335 (most preferred between positions 206/207, 219/220, 251/252 and/or 
252/253) in hCD4 3 or a peptide bond in proximity to these positions in hCD4, or 
peptide bonds in protein targets related to hCD4 at positions having structural 
homology or sequence homology to these positions. 

In a forty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Integrin alpha-L (hCD11a). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, psoriasis as well as other diseases 
connected with hCD1 1a. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating hCD11a (SEQ ID NO: 130). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 145/146, 152/153, 156/157, 159/160, 160/161, 177/178, 
178/179, 189/190, 190/191, 191/192, 193/194, 197/198, 200/201, 221/222, 
229/230, 249/250, 253/254, 268/269, 290/291, 297/298, 304/305 and/or 
305/306 (most preferred between positions 145/146, 159/160, 160/161, 
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189/190, 229/230, 249/250, 268/269, 297/298, 304/305 and/or 305/306) in 
hCD11a, or a peptide bond in proximity to these positions in hCD11a, or peptide 
bonds in protein targets related to hCD11a at positions having structural 
homology or sequence homology to these positions. 

In a forty-fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interferon-gamma receptor alpha chain (hi FN- 
gamma- R1 ) . The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, classical 
Hodgkin's Lymphoma (cHL) and type I diabetes, as well as other diseases 
connected with hi FN-gamma- R1 . Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hlFN-gamma- R1 (SEQ ID NO: 136). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 49/50, 52/53, 62/63, 106/107, 122/123, 
174/175, 215/216 and/or 222/223 (most preferred between positions 49/50, 
122/123, 174/175 and/or 215/216) in hi FN-gamma- R1 , or a peptide bond in 
proximity to these positions in hi FN-gamma- R1 , or peptide bonds in protein 
targets related to hIFN-gamma-R1 at positions having structural homology or 
sequence homology to these positions. 

In a forty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Platelet membrane glycoprotein I lb/ 1 1 la 
(hGPI Ib/llla). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, unstable 
angina, carotid stenting, ischemic stroke, peripheral vascular diseases, 
angiogenesis- related diseases and disseminating tumors, as well as other 
diseases connected with hGPI Ib/llla. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hGPI lb/ 1 1 la (SEQ ID NO: 141). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 67/68, 91/92, 129/130, 143/144, 144/145, 
181/182, 208/209, 209/210, 216/217, 239/240, 261/262, 410/411, 532/533, 
556/557, 557/558, 597/598, 650/651 and/or 689/690 (most preferred between 
positions 67/68, 261/262, 410/411, 650/651 and/ or 689/690) in hGPI Ib/llla, or 
a peptide bond in proximity to these positions in hGPI Ib/llla, or peptide bonds in 
protein targets related to hGPI Ib/llla at positions having structural homology or 
sequence homology to these positions. 
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In a forty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Intercellular adhesion molecule- 1 (hlCAM-1). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, Crohn's disease, as well as 
other diseases connected with hlCAM-1. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hlCAM-1 (SEQ ID NO: 142). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 40/41, 88/89, 97/98, 102/103, 128/129, 
131/132, 132/133, 149/150, 150/151, 160/161 and/or 166/167 (most preferred 
between positions 88/89, 102/103, 150/151, 160/161 and/or 166/167) in 
hlCAM-1, or a peptide bond in proximity to these positions in hlCAM-1, or 
peptide bonds in protein targets related to hi CAM- 1 at positions having structural 
homology or sequence homology to these positions. 

In a forty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human TGF-beta receptor type II (hTGF-beta Rll). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, diffuse systemic sclerosis, 
as well as other diseases connected with hTGF-beta Rll. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hTGF-beta Rll (SEQ ID 
NO: 145). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 32/33, 34/35, 35/36, 66/67, 
67/68, 69/70, 82/83, 103/104, 104/105, 105/106, 118/119, 122/123 and/or 
130/131 (most preferred between positions 32/33, 34/35, 66/67, 69/70, 
104/105, 122/123 and/or 130/131) in hTGF-beta Rll, or a peptide bond in 
proximity to these positions in hTGF-beta Rll, or peptide bonds in protein targets 
related to hTGF-beta Rll at positions having structural homology or sequence 
homology to these positions. 

In a forty-eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Membrane cofactor protein (hMCP). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, renal tumors, uveal 
melanomas and gastrointestinal tumors, as well as other diseases connected with 
hMCP. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hMCP (SEQ ID NO: 146). More preferably said enzyme or said fusion 
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protein is capable of hydrolysing the peptide bonds between positions 15/16, 
17/18, 25/26, 31/32, 32/33, 35/36, 48/49, 67/68, 69/70, 110/111, 1 19/120 
and/or 125/126 (most preferred between positions 15/16, 32/33, 48/49, 
119/120 and/or 125/126) 130/131) in hMCP, or a peptide bond in proximity to 
these positions in hMCP, or peptide bonds in protein targets related to hMCP at 
positions having structural homology or sequence homology to these positions. 

In a forty-ninth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Protease activated receptor 1 ( h PARI ) . The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, thrombosis, as well as 
other diseases connected with hPAFM. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hPAR1 (SEQ ID NO: 110). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 46/47, 51/52 and/or 52/53 in PARI, or a 
peptide bond in proximity to these positions in h PARI , or peptide bonds in 
protein targets related to hPAR1 at positions having structural homology or 
sequence homology to these positions. 

In a fifth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Protease activated receptor 2 (hPAR2). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease, Ulcerative colitis and 
Inflammatory bowel disease, asthma, inflammation associated pain and arthritis, 
as well as other diseases connected with hPAR2. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hPAR2 (SEQ ID NO: 111). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 41/42, 51/52 and/or 59/60 in hPAR2, or a 
peptide bond in proximity to these positions in hPAR2 5 or peptide bonds in 
protein targets related to hPAR2 at positions having structural homology or 
sequence homology to these positions. 

In a fifty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Protease activated receptor 4 (hPAR4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, thrombosis, as well as 
other diseases connected with hPAR4. Preferably, said enzyme or said fusion 



WO 2004/113522 



PCT/EP2004/051173 



42 

protein is capable of specifically inactivating hPAR4 (SEQ ID NO: 11 3). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 68/69, 74/75 and/or 78/79 in hPAR4 5 or a 
peptide bond in proximity to these positions in hPAR4 3 or peptide bonds in 
protein targets related to hPAR4 at positions having structural homology or 
sequence homology to these positions. 

In a fifty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human 5-hydroxytryptamine 1A receptor (h5-HT-1A). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, irritable bowel syndrome, 
as well as other diseases connected with h5-HT-1A. Preferably, said enzyme or 
said fusion protein is capable of specifically inactivating h5-HT-1A (SEQ ID 
NO: 11 7). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 101/102, 102/103, 181/182 
and/or 370/371 in h5-HT-1A a peptide bond in proximity to these positions in 
h5-HT-1A, or peptide bonds in protein targets related to h5-HT-1A at positions 
having structural homology or sequence homology to these positions. 

In a fifty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human carcinoembryonic antigen (hCEA). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, colon cancer, as well as other diseases 
connected with hCEA. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hCEA (SEQ ID NO: 138). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
17/18, 69/70, 71/72, 74/75, 77/78, 98/99, 116/117, 126/127 and/or 128/129 in 
hCEA, or a peptide bond in proximity to these positions in hCEA, or peptide 
bonds in protein targets related to hCEA at positions having structural homology 
or sequence homology to these positions. 

It is obvious to someone skilled in the art that also polymorphisms of all target 
sequences referred to are included. The expression "proximity to these positions" 
in all embodiments above refer to positions of peptide bonds that are between 10 
and 5 Angstrom and/or 5 amino acids, preferably 3 amino acids, next to the 
positions of the peptide bonds 
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Preferably, in this variant the scaffold of the engineered enzyme provided in step 
(c) is of human origin in order to avoid or reduce immunogenicity or allergenic 
effects associated with the application of the enzyme in the human body. 
Alternatively, immunoenicity and allergenicity can be reduced by deimmunization 
of the engineered enzyme. 

In further embodiment of this variant, the target substrate is a pro-drug which is 
activated by the engineered enzyme. In a particular embodiment of this variant, 
the engineered enzyme has proteolytic activity and the target substrate is a 
protein target which is proteolytically activated. Examples of such pro-drugs are 
pro-proteins such as the inactivated forms of coagulations factors. In another 
particular variant, the engineered enzyme is an oxidoreductase and the target 
substrate is a chemical that can be activated by oxidation. 

In a second variant of this aspect of the invention, the engineered enzyme is 
used for diagnostic puposes. In a particular embodiment of this variant, the 
engineered enzyme is target-specific protease. Such diagnostic purposes 
comprise but are not limited to applications with the aim of diagnosing diseases, 
testing genetic predispositions or monitoring disease progression during therapy. 
In a particular embodiment, the diagnosis is based on the testing for the 
presence or absence of a disease-specific marker protein or a disease- specific 
variant of a human protein in test samples such as human tissue samples, blood 
samples or other samples taken from patients. The testing employs a protease 
with specificity for a particular, disease- related target protein. The testing is done 
by analysing the proteolytic degradation of such protein in the test sample. 
In a preferred embodiment the aim of the diagnostic test is to detect and/or 
quantify a disease-specific variant of a native human protein. Such a diagnostic 
test employs a protease that is specific for the disease-related protein variant, 
i.e. it has significantly higher proteolytic activity on the disease- related protein 
variant compared to the native human protein. The disease- related protein 
variant is therefore detected and/or quantified by detecting and/or quantifying 
the activity of the target-specific protease. Such detection and/ or quantification 
is done by directly measuring the degradation products of the target protein or 
indirectly by measuring the influence of the target protein on the activity of the 
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target-specific protease by a competition assay. In another preferred 
embodiment the aim of the diagnostic test is to detect and/or quantify a protein 
that is specific for an infection by an infectious agent such as a virus or a 
bacterium. Such a diagnostic test employs a protease that is specific for a protein 
specifically expressed upon infection by the infectious agent, i.e. it has 
significantly higher proteolytic activity on a particular infection-indicating protein 
compared to any other native human protein. The infection- indicating protein is 
therefore detected and/or quantified by detecting and/or quantifying the 
proteolytic activity of the target-specific protease. Such detection and/or 
quantification is done by directly measuring the degradation products of the 
infection-indicating protein or indirectly by measuring the influence of the 
infection-indicating protein on the activity of the target-specific protease by a 
competition assay. 

In a third variant of this aspect of the invention, the engineered enzyme is used 
as a technical means in order to catalyze an industrially or nutritionally relevant 
reaction with defined specificity. In a particular embodiment of this variant the 
engineered enzyme has proteolytic activity, the catalyzed reaction is a proteolytic 
processing, and the engineered enzyme specifically hydrolyses one or more 
industrially or nutrionally relevant protein substrates. In a preferred embodiment 
of this variant the engineered enzyme hydrolyses one or more industrially or 
nutrionally relevant protein substrates at specific sites, thereby leading to 
industrially or nutrionally desired product properties such as texture, taste or 
precipitation characteristics. In a further particular embodiment of this variant, 
the engineered enzyme catalyzes the hydrolysis of glycosidic bonds (glycosidase 
or glycosylases activity). Then, preferably, the catalyzed reaction is a 
polysaccharide processing, and the engineered enzyme specifically hydrolyses 
one or more industrially, technically or nutrionally relevant polysaccharide 
substrates. In a further particular embodiment of this variant, the engineered 
enzyme catalyzes the hydrolysis of triglyceride esters or lipids (lipase activity). 
Then, preferably, the catalyzed reaction is a lipid processing step, and the 
engineered enzyme specifically hydrolyses one or more industrially, technically or 
nutrionally relevant lipid substrates. In a further particular variant of this 
embodiment, the engineered enzyme catalyzes the oxidation or reduction of 
substrates (oxidoreductase activity). Then, preferably, the engineered enzyme 
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specifically oxidizes or reduces one or more industrially, technically or nutrionally 
relevant chemical substrates. 

A second aspect of the invention discloses engineered enzymes with defined 
specificities. These engineered enzymes are characterized by the following 
components: 

(a) a protein scaffold capable of catalyzing at least one chemical reaction on a 
substrate, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between ar least one target substrate and one or more different substrates, 
wherein the SDRs are essentially synthetic peptide sequences. 

Preferably, such defined specificity of the engineered enzymes is not conferred 
by the protein scaffold. 

In principle, the protein scaffold can have a variety of primary, secondary and 
tertiary structures. The primary structure, i.e. the amino acid sequence, can be 
an engineered sequence or can be derived from any viral, prokaryotic or 
eukaryotic origin. For human therapeutic use, however, the protein scaffold is 
preferably of mammalian origin, and more preferably, of human origin. 
Furthermore, the protein scaffold is capable to catalyze one or more chemical 
reactions and has preferably only a low specificity. 

Preferably, derivatives of the protein scaffold are used that have modified amino 
acid sequences that confer improved characteristics for the applicability as 
protein scaffolds. Such improved characteristics comprise, but are not limited to, 
stability; expression or secretion yield; folding, in particular after combination of 
the protein scaffold with SDRs; increased or decreased sensitivity to regulators 
such as activators or inhibitors; imrnunogenicity; catalytic rate; kM or substrate 
affinity. 

The engineered enzymes reveal their quantitative specificity from the synthetic 
peptide sequences that are combined with the protein scaffold. Therefore, the 
engineered peptide sequences are acting as Specificity Determining Regions or 
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SDRs. The number, the length and the positions of such SDRs can vary over a 
wide range. The number of SDRs within the scaffold is at least one, preferably 
more than one, more preferably between two and eleven, most preferably 
between two and six. The SDRs have a length between one and 50 amino acid 
residues, preferably a length between one and 15 amino acid residues, more 
preferably a length between one and six amino acid residues. Alternatively, the 
SDRs have a length between two and 20 amino acid residues, preferably a length 
between two and ten amino acid residues, more preferably a length between 
three and eight amino acid residues. 

The inventive engineered enzymes can further be desribed as antibody-like 
protein molecules comprising constant and variable regions, but having a non- 
immunoglogulin backbone and having an active site (catalytic activity) in the 
constant region, whereby the substrate specificity of the active site is modulated 
by the variable region. Preferably, as in the immunoglobulin structure, the 
variable regions are loops of variable length and composition that interact with a 
target molecule. 

In a particular variant of the invention, the engineered enzymes have hydrolase 
activity. In a preferred variant, the engineered enzymes have proteolytic activity. 
Particularly preferred protein scaffolds for this variant are unspecific proteases or 
are parts from unspecific proteases or are otherwise derived from unspecific 
proteases. The expressions "derived from" or "a derivative thereof" in this 
respect and in the following variants and embodiments refer to derivatives of 
proteins that are mutated at one or more amino acid positions and/or have a 
homology of at least 70%, preferably 90%, more preferably 95% and most 
preferably 99% to the original protein, and/or that are proteolytically processed, 
and/or that have an altered glycosylation pattern, and/or that are covalently 
linked to non-protein substances, and/or that are fused with further protein 
domains, and/or that have C-terminal and/or N-terminal truncations, and/or that 
have specific insertions, substitutions and/or deletions. Alternatively, "derived 
from" may refer to derivatives that are combinations or chimeras of two or more 
fragments from two or more proteins, each of which optionally comprises any or 
all of the aforementioned modifications. The tertiary structure of the protein 
scaffold can be of any type. Preferably, however, the tertiary structure belongs to 
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one of the following structural classes: class S1 (chymotrypsin fold of the serine 
proteases family), class S8 (subtilisin fold of the serine proteases family), class 
SG (carboxypeptidase fold of the serine proteases family), class A1 (pepsin A fold 
of the aspartic proteases), or class C14 (caspase-1 fold of the cysteine 
proteases). Examples of proteases that can serve as the protein scaffold of 
engineered proteolytic enzymes for the use as human therapeutics are or are 
derived from human trypsin, human thrombin, human chymotrypsin, human 
pepsin, human endothiapepsin, human caspases 1 to 14, and/or human furin. 

The defined specificity of the engineered proteolytic enzymes is a measure of 
their ability to discriminate between at least one target peptide or protein 
substrates and one or more further peptide or protein substrates. Preferably, the 
defined specificity refers to the ability to discriminate peptide or protein 
substrates that differ in other positions than the P1 site, more preferably, the 
defined specificity refers to the ability to discriminate peptide or protein 
substrates that differ in other positions than the P1 site and the P1' site. Most 
preferably, the engineered proteolytic enzymes distinguish target peptid or 
protein substrates at as many sites as is necessary to preferentially hydrolyse 
the target substrate versus other proteins. As an example, a therapeutically 
useful engineered proteolytic enzyme applied intravenously in the human body 
should be sufficiently specific to discriminate between the target substrate and 
any other protein in the human serum. Preferably, such an engineered 
proteolytic enzyme recognizes and discriminates peptide substrates at three or 
more amino acid positions, more preferably at four or more positions, and even 
more preferably at five or more amino acid positions. These positions may either 
be adjacent or non- adjacent. 

In a first embodiment , the protein scaffold has a tertiary structure or fold equal 
or similar to the tertiary structure or fold of the S1 structural subclass of serine 
proteases, i. e. the chymotrypsin fold, and/or has at least 70% identity on the 
amino acid level to a protein of the S1 structural subclass of serine proteases. It 
is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 18-25, 38-48, 54-63, 73-86, 122-130, 
148-156, 165-171 and 194-204 in human trypsin I, and more preferably at one 
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or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 20-23, 41-45, 57-60, 76-83, 125- 
128, 150-153, 167-169 and 197-201 (numbering of amino acids according to 
SEQ ID NO:1). The number of SDRs to be combined with this type of protein 
scaffold is preferably between 1 and 10, and more preferably between 2 and 4. 
Preferably, the protein scaffold is equal to or is a derivative or homologue of one 
or more of the following proteins: chymotrypsin, granzyme, kallikrein, trypsin, 
mesotrypsin, neutrophil elastase, pancreatic elastase, enteropeptidase, 
cathepsin, thrombin, ancrod, coagulation factor IXa, coagulation factor Vila, 
coagulation factor Xa 3 activated protein C, urokinase, tissue-type plasminogen 
activator, plasmin, Desmodus-type plasminogen activator. More preferably, the 
protein scaffold is trypsin or thrombin or is a derivative or homologue from 
trypsin or thrombin. For the use as a human therapeutic, the trypsin or thrombin 
scaffold is most preferably of human origin in order to minimize the risk of an 
immune response or an allergenic reaction. 

Preferably, derivatives with improved characteristics derived from human trypsin 
I or from proteins with similar tertiary structure are used. Preferred examples of 
such derivatives are derived from human trypsin I (SEQ ID NO:1) and comprise 
one or more of the following amino acid substitutions E56G; R78W; Y131F; 
A146T; C183R. 

It is preferred that at least one of two SDRs are inserted into human trypsin I, or 
a derivative thereof, between residues 42 and 43 (SDR 1) and between 123 and 
124 (SDR 2), respectively (numbering of amino acids according to SEQ ID NO:1). 
In addition the SDR 1 has a preferred length of 6 and the SDR 2 has a preferred 
length of 5 amino acids, respectively. In a preferred variant of this embodiment, 
the SDR 1 and SDR 2 sequences comprise one of the amino acid sequences listed 
in table 2. Such engineered proteolytic enzymes have specificity for the target 
substrate B as exemplified in example iV. 

In a further embodiment the protein scaffold belongs to the S8 structural 
subclass of serine proteases and/or has a tertiary structure similar to subtilisin E 
from Bacillus subtilis_and/or has at least 70% identity on the amino acid level to 
a protein of the S8 structural subclass of serine proteases. Preferably, the 
scaffold belongs to the subtilisin family or the human pro-protein convertases. It 
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is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 6-17, 25-29, 47-55, 59-69, 101-111, 
117-125, 129-137, 139-154, 158-169, 185-195 and 204-225 in subtilisin E from 
Bacillus subtilis, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 59-69, 101-111, 129-137, 158-169 and 204-225 (numbering of amino 
acids according to SEQ ID NO: 7). It is preferred that the protein scaffold is equal 
to or is a derivative or homologue of one or more of the following proteins: 
subtilisin Carlsberg; B. subtilis subtilisin E; subtilisin BPN'; B. licheniformis 
subtilisin; B. lentus subtilisin; Bacillus alcalophilus alkaline protease; proteinase 
K; kexin; human pro-protein convertase; human furin. In a preferred variant, 
subtilisin BPN' or one of the proteins SPC 1 to 7 is used as the protein scaffold. 

In a further embodiment the protein scaffold belongs to the family of aspartic 
proteases and/or has a tertiary structure similar to human pepsin. Preferably, the 
scaffold belongs to the A1 class of proteases and/or has at least 70% identity on 
the amino acid level to a protein of the A1 class of proteases. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 6-18, 49-55, 74-83, 91-97, 112-120, 126-137, 159- 
164, 184-194, 242-247, 262-267 and 277-300 in human pepsin, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 10-15, 75-80, 
114-1 18, 130-134, 186-191 and 280-296 (numbering of amino acids according 
to SEQ ID NO: 11). It is preferred that the protein scaffold is equal to or is a 
derivative or homologue of one or more of the following proteins: pepsin, 
chymosin, renin, cathepsin, yapsin. Preferably, pepsin or endothiopepsin or a 
derivative or homologue thereof is used as the protein scaffold. 

In a further embodiment the protein scaffold belongs to the cysteine protease 
family and/or has a tertiary structure similar to human caspase 7. Preferably the 
scaffold belongs to the C14 class of cysteine proteases or has at least 70% 
identity on the amino acid level to a protein of the C14 class of cysteine 
proteases. It is preferred that SDRs are inserted into the protein scaffold at one 



WO 2004/113522 



PCT/EP2004/051173 



50 

or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 78-91, 144-160, 186-198, 226- 
243 and 271-291 in human caspase 7, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 80-86, 149-157, 190-194 and 233-238 
(numbering of amino acids according to SEQ ID NO: 14). It is preferred that the 
protein scaffold is equal to or is a derivative or homologue of one of the caspases 
1 to 9. 

In a further embodiment the protein scaffold belongs to the S1 1 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S11 class of serine proteases and/or has a tertiary structure similar to D-alanyl- 
D-alanine transpeptidase from Streptomyces species K15. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 67-79, 137-150, 191-206, 212-222 and 241 -251 in D- 
alanyl-D- alanine transpeptidase from Streptomyces species K15, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 70-75, 141 -147, 
195-202 and 216-220 (numbering of amino acids according to SEQ ID NO: 15). It 
is preferred that the D-alanyl-D-alanine transpeptidase from Streptomyces 
species K1 5 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the S21 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S21 class of serine proteases and/or has a tertiary structure similar to assemblin 
from human cytomegalovirus. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 25- 
33, 64-69, 134-155, 162-169 and 217-244 in assemblin from human 
cytomegalovirus, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 27-31, 164-168 and 222-239 (numbering of amino acids according to 
SEQ ID NO: 16). It is preferred that the assemblin from human cytomegalovirus 
or a derivative or homologue thereof is used as the scaffold. 
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In a further embodiment the protein scaffold belongs to the S26 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S26 class of serine proteases and/or has a tertiary structure similar to the signal 
peptidase from Escherichia coli. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-14, 
57-68, 125-134, 239-254, 200-211 and 228-239 in signal peptidase from 
Escherichia coli, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 9-13, 60-67, 127-132 and 203-209 (numbering of amino acids according 
to SEQ ID NO: 17). It is preferred that the signal peptidase from Escherichia coli 
or a derivative or homologue thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the S33 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S33 class of serine proteases and/or has a tertiary structure similar to the prolyl 
aminopeptidase from Serratia marcescens. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 47- 
54, 152-160, 203-212 and 297-302 in prolyl aminopeptidase from Serratia 
marcescens, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 50-53, 154-158 and 206-210 (numbering of amino acids according to 
SEQ ID NO: 18). It is preferred that the prolyl aminopeptidase from Serratia 
marcescens or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the S51 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S51 class of serine proteases and/or has a tertiary structure similar to aspartyl 
dipeptidase from Escherichia coli. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-16, 
38-46, 85-92, 132-140, 159-170 and 205-211 in aspartyl dipeptidase from 
Escherichia coli, and more preferably at one or more positions from the group of 



WO 2004/113522 



PCT/EP2004/051173 



52 

positions that correspond structurally or by amino acid sequence homology to the 
regions 10-14, 87-90, 134-138 and 160-165 (numbering of amino acids 
according to SEQ ID NO; 19). It is preferred that the aspartyl dipeptidase from 
Escherichia coli or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the A2 class of aspartic 
proteases or has at least 70% identity on the amino acid level to a protein of the 
A2 class of aspartic proteases and/or has a tertiary structure similar to the 
protease from human immunodeficiency virus. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 5-12, 17-23, 27-30, 33-38 and 77-83 in protease from human 
immunodeficiency virus, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 7-10, 18-21, 34-37 and 79-82 (numbering of amino 
acids according to SEQ ID NO:20). It is preferred that the protease from human 
immunodeficiency virus, preferably HIV-1 protease, or a derivative or homologue 
thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the A26 class of 
aspartic proteases or has at least 70% identity on the amino acid level to a 
protein of the A26 class of aspartic proteases and/or has a tertiary structure 
similar to the omptin from Escherichia coli. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 28- 
40, 86-98, 150-168, 213-219 and 267-278 in omptin from Escherichia coli, and 
more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 33- 
38, 161-168 and 273-277 (numbering of amino acids according to SEQ ID 
NO:21). It is preferred that the omptin from Escherichia coli or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C1 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C1 class of cysteine proteases and/or has a tertiary structure similar to the 
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papain from Carica papaya. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 17-24, 61-68, 
88-95, 135-142, 153-158 and 176-184 in papain from Carica papaya, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 63-66, 136-139 
and 177-181 (numbering of amino acids according to SEQ ID NO:22). It is 
preferred that the papain from Carica papaya or a derivative or homologue 
thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C2 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C2 class of cysteine proteases and/or has a tertiary structure similar to human 
caIpain-2. It is preferred that SDRs are inserted into the protein scaffold at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 90-103, 160-172, 193-199, 243- 
260, 286-294 and 316-322 in human calpain-2, and more preferably at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 92-101, 245-250 and 287-291 
(numbering of amino acids according to SEQ ID NO:23). It is preferred that the 
human calpain-2 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C4 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C4 class of cysteine proteases and/or has a tertiary structure similar to Nla 
protease from tobacco etch virus. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 23- 
31, 1 12-120, 144-150, 168-176 and 205-218 in Nla protease from tobacco etch 
virus, and more preferably at one or more positions from the group of positions 
that correspond structurally or by amino acid sequence homology to the regions 
145-149, 169-174 and 212-218 (numbering of amino acids according to SEQ ID 
NO:24). It is preferred that the Nla protease from tobacco etch virus (TEV 
protease) or a derivative or homologue thereof is used as the scaffold. 
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In a further embodiment the protein scaffold belongs to the C10 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
G10 class of cysteine proteases and/or has a tertiary structure similar to the 
streptopain from Streptococcus pyogenes. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 81 - 
90, 133-140, 150-164, 191-199, 219-229, 246-256, 306-312 and 330-337 in 
streptopain from Streptococcus pyogenes, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 82-87, 134-138, 250-254 and 331-335 
(numbering of amino acids according to SEQ ID NO:25). It is preferred that the 
streptopain from Streptococcus pyogenes or a derivative or homologue thereof is 
used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C19 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C19 class of cysteine proteases and/or has a tertiary structure similar to human 
ubiquitin specific protease 7. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 3-15, 
63-70, 80-86, 248-256, 272-283 and 292-304 in human ubiquitin specific 
protease 7, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 10-15, 251-255, 277-281 and 298-304 (numbering of amino acids 
according to SEQ ID NO:26). It is preferred that the human ubiquitin specific 
protease 7 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C47 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C47 class of cysteine proteases and/or has a tertiary structure similar to the 
staphopain from Staphylococcus aureus. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 15- 
23, 57-66, 108-119, 142-149 and 157-164 in staphopain from Staphylococcus 
aureus, and more preferably at one or more positions from the group of positions 
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that correspond structurally or by amino acid sequence homology to the regions 
17-22, 111-117, 143-147 and 159-163 (numbering of amino acids according to 
SEQ ID NO:27). It is preferred that the staphopain from Staphylococcus aureus 
or a derivative or homologue thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the C48 class of 
cysteine proteases or has at least 70% identity on the amino acid level to a 
protein of the C48 class of cysteine proteases and/or has a tertiary structure 
similar to the Ulp1 endopeptidase from Saccharomyces cerevisiae. It is preferred 
that SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 40-51, 108-115, 132-141, 173-179 and 597-605 in 
Ulp1 endopeptidase from Saccharomyces cerevisiae, and more preferably at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 43-49, 110-113, 133-137 and 
175-178 (numbering of amino acids according to SEQ ID NO:28). It is preferred 
that the Ulp1 endopeptidase from Saccharomyces cerevisiae or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C56 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C56 class of cysteine proteases and/or has a tertiary structure similar to the Pfpl 
endopeptidase from Pyrococcus horikoshii. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-16, 
40-47, 66-73, 1 18-125 and 147-153 in Pfpl endopeptidase from Pyrococcus 
horikoshii, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 9-14, 68-71, 120-123 and 148-151 (numbering of amino acids according 
to SEQ ID NO:29). It is preferred that the Pfpl endopeptidase from Pyrococcus 
horikoshii or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the M4 class of metallo 
proteases or has at least 70% identity on the amino acid level to a protein of the 
M4 class of metallo proteases and/or has a tertiary structure similar to 
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thermolysin from Bacillus thermoproteolyticus. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 106-118, 125-130, 152-160, 197-204, 210-213 and 221-229 in 
thermolysin from Bacillus thermoproteolyticus, and more preferably at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 108-115, 126-129, 199-203 and 
223-227 (numbering of amino acids according to SEQ ID NO:30). It is preferred 
that the thermolysin from Bacillus thermoproteolyticus or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the M10 class of metallo 
proteases or has at least 70% identity on the amino acid level to a protein of the 
M10 class of metallo proteases and/or has a tertiary structure similar to human 
collagenase. It is preferred that SDRs are inserted into the protein scaffold at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 2-7, 68-79, 85-90, 107-111 and 
135-141 in human collagenase, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 3-6, 71-78 and 136-140 (numbering of amino 
acids according to SEQ ID NO:31). It is preferred that human collagenase or a 
derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have glycosidase activity. A 
particularly suited protein scaffold for this variant is a glycosylase or is derived 
from a glycosylase. Preferably, the tertiary structure belongs to one of the 
following structural classes: class GH13, GH7, GH12, GH11, GH10, GH28, GH26, 
and GH18 (beta/alpha)8 barrel. 

In a first embodiment the protein scaffold belongs to the GH13 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH13 class of glycosylases and/or has a tertiary structure similar to human 
pancreatic alpha-amylase. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 50-60, 100-110, 
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148-167, 235-244, 302-310 and 346-359 in human pancreatic alpha-amylase, 
and more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 51- 
58, 148-155 and 303-309 (numbering of amino acids according to SEQ ID 
NO:32). It is preferred that human pancreatic alpha-amylase or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GHZ class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH7 class of glycosylases and/or has a tertiary structure similar to cellulase 
from Trichoderma reesei. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 47-56, 93-104, 
173-182, 215-223, 229-236 and 322-334 in cellulase from Trichoderma reesei, 
and more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 175- 
180, 218-222 and 324-332 (numbering of amino acids according to SEQ ID 
NO: 33). It is preferred that cellulase from Trichoderma reesei or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH12 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH12 class of glycosylases and/or has a tertiary structure similar to cellulase 
from Aspergillus niger. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 18-28, 55-60, 
106-113, 126-132 and 149-159 in cellulase from Aspergillus niger, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-26, 56-59, 
108-1 12 and 151-156 (numbering of amino acids according to SEQ ID NO:34). It 
is preferred that cellulase from Aspergillus niger or a derivative or homologue 
thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH11 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
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the GH1 1 class of glycosylases and/or has a tertiary structure similar to xylanase 
from Aspergillus niger. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 7-14, 33-39, 88- 
97, 114-126 and 158-167 in xylanase from Aspergillus niger, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-26, 56-59, 
108-1 12 and 151-156 (numbering of amino acids according to SEQ ID NO: 35). It 
is preferred that xylanase from Aspergillus niger or a derivative or homologue 
thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH10 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH10 class of glycosylases and/or has a tertiary structure similar to xylanase 
from Streptomyces lividans. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 21- 
29, 42-50, 84-92, 130-136, 206-217 and 269-278 in xylanase from 
Streptomyces lividans, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 43-49, 86-90, 208-213 and 271-276 (numbering of 
amino acids according to SEQ ID NO:36). It is preferred that xylanase from 
Streptomyces lividans or a derivative or homologue thereof is used as the 
scaffold. 

In a further embodiment the protein scaffold belongs to the GH28 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH28 class of glycosylases and/or has a tertiary structure similar to 
pectinase from Aspergillus niger. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 82- 
88, 118-126, 171-178, 228-236, 256-264 and 289-299 in pectinase from 
Aspergillus niger, and more preferably at one or more positions from the group 
of positions that correspond structurally or by amino acid sequence homology to 
the regions 116-124, 174-178 and 291-296 (numbering of amino acids according 



WO 2004/113522 



PCT/EP2004/051173 



59 

to SEQ ID NO:37). It is preferred that pectinase from Aspergillus niger or a 
derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH26 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH26 class of glycosylases and/or has a tertiary structure similar to 
mannanase from Pseudomonas cellulosa. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 75- 
83, 113-125, 174-182, 217-224, 247-254, 324-332 and 325-340 in mannanase 
from Pseudomonas cellulosa, and more preferably at one or more positions from 
the group of positions that correspond structurally or by amino acid sequence 
homology to the regions 1 15-123, 176-180, 286-291 and 328-337 (numbering of 
amino acids according to SEQ ID NO: 38). It is preferred that mannanase from 
Pseudomonas cellulosa or a derivative or homologue thereof is used as the 
scaffold. 

In an further embodiment the protein scaffold belongs to the GH18 (beta/alpha)8 
barrel class of glycosylases or has at least 70% identity on the amino acid level 
to a protein of the GH18 class of glycosylases and/or has a tertiary structure 
similar to chitinase from Bacillus circulans. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 21- 
29, 57-65, 130-136, 176-183, 221-229, 249-257 and 327-337 in chitinase from 
Bacillus circulans, and more preferably at one or more positions from the group 
of positions that correspond structurally or by amino acid sequence homology to 
the regions 59-63, 178-181, 250-254 and 330-336 (numbering of amino acids 
according to SEQ ID NO: 39). It is preferred that chitinase from Bacillus circulans 
or a derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have esterhydrolase activity. 
Preferably, the protein scaffold for this variant have lipase, phosphatase, 
phytase, or phosphodiesterase activity. 
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In a first embodiment the protein scaffold belongs to the GX class of esterases or 
has at feast 70% identity on the amino acid level to a protein of the GX class of 
esterases and/or has a tertiary structure similar to the structure of the lipase B 
from Candida antarctica. Preferably, the scaffold has lipase activity. It is 
preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 139-148, 188-195, 216-224, 256-266, 
272-287 in lipase B from Candida antarctica, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 141 -146, 21 8-222, 259-263 and 275-283 
(numbering of amino acids according to SEQ ID NO: 40). It is preferred that 
lipase B from Candida antarctica or a derivative or homologue thereof is used as 
the scaffold. 

In a further embodiment the protein scaffold belongs to the GX class of esterases 
or has at least 70% identity on the amino acid level to a protein of the GX class 
of esterases and/or has a tertiary structure similar to the pancreatic lipase from 
guinea pig. Preferably, the scaffold has lipase activity. It is preferred that SDRs 
are inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 78-90, 91-100, 112-120, 179-186, 207-218, 238-247 and 248-260 in 
pancreatic lipase from guinea pig, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 80-87, 114-118, 209-215 and 239-246 
(numbering of amino acids according to SEQ ID NO:41). It is preferred that 
pancreatic lipase from guinea pig or a derivative or homologue thereof is used as 
the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the alkaline phosphatase from Escherichia coli or has at least 
70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the alkaline phosphatase from Escherichia coli. 
Preferably, the scaffold has phosphatase activity. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
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regions 110-122, 187-142, 170-175, 186-193, 280-287 and 425-435 in alkaline 
phosphatase from Escherichia coli, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 171-174, 187-191, 282-286 and 426-433 
(numbering of amino acids according to SEQ ID NO:42). It is preferred that 
alkaline phosphatase from Escherichia coli or a derivative or homologue thereof 
is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the bovine pancreatic desoxyribonuclease I or has at least 70% 
identity on the amino acid level to a protein that has a tertiary structure similar 
to the structure of the bovine pancreatic desoxyribonuclease I. Preferably, the 
scaffold has phosphodiesterase activity. More preferably, a nuclease, and most 
preferably, an unspecific endonuclease or a derivative thereof is used as the 
scaffold. It is preferred that SDRs are inserted into the protein scaffold at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 14-21, 41-47, 72-77, 97-111, 
135-143, 171-178, 202-209 and 242-251 in bovine pancreatic 
desoxyribonuclease I, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 16-19, 42-46, 136-141 and 172-176 (numbering of 
amino acids according to SEQ ID NO:43). It is preferred that bovine pancreatic 
desoxyribonuclease I or human desoxyribonuclease I or a derivative or 
homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzyme has transferase activity. A 
particularly suited protein scaffold for this variant is a glycosyl-, a phospho- or a 
methyltransferase, or is a derivative thereof. Particularly preferred protein 
scaffolds for this variant are glycosyltransferases or are derived from 
glycosyltransferases. The tertiary structure of the protein scaffold can be of any 
type. Preferably, however, the tertiary structure belongs to one of the following 
structural classes: GH13 and GT1 . 

In a first embodiment the protein scaffold belongs to the GH13 class of 
transferases or has at least 70% identity on the amino acid level to a protein of 
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the GH13 class of transferases and/or has a tertiary structure similar to the 
structure of the cyclomaltodextrin glucanotransferase from Bacillus circulans. 
Preferably, the scaffold has transferase activity, and more preferably a 
glycosyltransferase is used as the scaffold. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 38- 
48, 85-94, 142-154, 178-186, 259-266, 331-340 and 367-377 in 
cyclomaltodextrin glucanotransferase from Bacillus circulans, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 87-92, 180-185, 
261-264 and 269-275 (numbering of amino acids according to SEQ ID NO;44). It 
is preferred that cyclomaltodextrin glucanotransferase from Bacillus circulans or 
a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GT1 class of 
tranferases or has at least 70% identity on the amino acid level to a protein of 
the GT1 class of transferases and/or has a tertiary structure similar to the 
structure of the glycosyltransferase from Amycolatopsis orientalis A82846. 
Preferably the scaffold has transferase activity, and more preferably 
glycosyltransferase activity. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 58- 
74, 130-138, 185-193, 228-236 and 314-323 in glycosyltransferase from 
Amycolatopsis orientalis A82846, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 61-71, 230-234 and 316-321 (numbering of 
amino acids according to SEQ ID NO:45). It is preferred that the 
glycosyltransferase from Amycolatopsis orientalis A82846 or a derivative or 
homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have oxidoreductase activity. 
A particularly suited protein scaffold for this variant is a monooxygenase, a 
dioxygenase or a alcohol dehydrogenase, or a derivative thereof. The tertiary 
structure of the protein scaffold can be of any type. 
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In a first embodiment the protein scaffold has a tertiary structure similar to the 
structure of the 2,3-diphydroxybiphenyl dioxygenase from Pseudomonas sp. or 
has at least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the 2,3-diphydroxybiphenyl dioxygenase 
from Pseudomonas sp. Preferably, the scaffold has dioxygenase activity. It is 
preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 172-185, 198-206, 231-237, 250-259 
and 282-287 in 2,3-diphydroxybipheny! dioxygenase from Pseudomonas sp., and 
more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 175- 
182, 200-204, 252-257 and 284-287 (numbering of amino acids according to 
SEQ ID NO:46). It is preferred that the 2,3-diphydroxybiphenyl dioxygenase 
from Pseudomonas sp or a derivative or homologue thereof is used as the 
scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the catechol dioxygenase from Acinetobacter sp. or has at least 
70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the catechol dioxygenase from Acinetobacter sp.. 
Preferably, the scaffold has dioxygenase activity, and more preferably catechol 
dioxygenase activity. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 66-72, 105-112, 
156-171 and 198-207 in catechol dioxygenase from Acinetobacter sp., and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 107-110, 161- 
171 and 201-205 (numbering of amino acids according to SEQ ID NO:47). It is 
preferred that the catechol dioxygenase from Acinetobacter sp or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the camphor-5-monooxygenase from Pseudomonas putida or has 
at least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the camphor-5-monooxygenase from 
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Pseudomonas putida. Preferably, the scaffold has monooxygenase activity, and 
more preferably camphor monooxygenase activity. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 26-31, 57-63, 84-98, 182-191, 242-256, 292-299 and 392-399 in 
camphor-5-monooxygenase from Pseudomonas putida, and more preferably at 
one or more positions from the group of positions that correspond structurally or 
by amino acid sequence homology to the regions 85-96, 183-188, 244-253, 293- 
298 and 393-398 (numbering of amino acids according to SEQ ID NO: 48). It is 
preferred that the cam phor-5~ monooxygenase from Pseudomonas putida or a 
derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the alcohol dehydrogenase from Equus callabus or has at least 
70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the alcohol dehydrogenase from Equus callabus. 
Preferably, the scaffold has alcohol dehydrogenase activity. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 49-63, 111-112, 294-301 and 361-369 in alcohol 
dehydrogenase from Equus callabus, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 51-61 and 295-299 (numbering of amino 
acids according to SEQ ID NO:49). It is preferred that the alcohol dehydrogenase 
from Equus callabus or a derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have lyase activity. A 
particularly suited protein scaffold for this variant is a oxoacid lyase or is a 
derivative thereof. Particularly preferred protein scaffolds for this variant are 
aldolases or synthases, or are derived thereof. The tertiary structure of the 
protein scaffold can be of any type, but a (beta/alpha)8 barrel structure is 
preferred. 

In a first embodiment the protein scaffold has a tertiary structure similar to the 
structure of the N-acetyl-d-neuramic acid aldolase from Escherichia coli or has at 



WO 2004/113522 



PCT/EP2004/051173 



65 

least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the l^acetyl-d-neuramic acid aldolase from 
Escherichia coli. Preferably, the scaffold has aldolase activity. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 45-55, 78-87, 105-113, 137-146, 164-171, 187-193, 
205-210, 244-255 and 269-276 in N-acetyl-d-neuramic acid aldolase from 
Escherichia coli, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 45-52, 138-144, 189-192, 247-253 and 271-275 (numbering of amino 
acids according to SEQ ID NO:50). It is preferred that the N-acetyl-d-neuramic 
acid aldolase from Escherichia coli or a derivative or homologue thereof is used 
as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the tryptophan synthase from Salmonella typhimurium or has at 
least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the tryptophan synthase from Salmonella 
typhimurium. Preferably, the scaffold has synthase activity. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 56-63, 127-134, 154-161, 175-193, 209-216 and 230- 
240 in tryptophan synthase from Salmonella typhimurium, and more preferably 
at one or more positions from the group of positions that correspond structurally 
or by amino acid sequence homology to the regions 57-62, 155-160, 178-190 
and 210-215 (numbering of amino acids according to SEQ ID NO:51). It is 
preferred that the tryptophan synthase from Salmonella typhimurium or a 
derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have isomerase activity. A 
particularly suited protein scaffold for this variant is a converting aldose or a 
converting ketose, or is a derivative thereof. 

In a first embodiment, the protein scaffold has a tertiary structure similar to the 
structure of the xylose isomerase from Actinoplanes missouriensis or has at least 
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70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the xylose isomerase from Actinoplanes missouriensis. 
It is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 18-31, 92-103, 136-147, 178-188 and 
250-257 in xylose isomerase from Actinoplanes missouriensis, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-27, 92-99 
and 180-186 (numbering of amino acids according to SEQ ID IMO:52). It is 
preferred that the xylose isomerase from Actinoplanes missouriensis or a 
derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have ligase activity. A 
particularly suited protein scaffold for this variant is a DNA ligase, or is a 
derivative thereof. 

In a first embodiment, the protein scaffold has a tertiary structure similar to the 
structure of the DNA ligase from Bacteriophage T7 or has at least 70% identity 
on the amino acid level to a protein that has a tertiary structure similar to the 
structure of the DNA-ligase from Bacteriophage T7. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 52-60, 94-108, 119-131, 241-248, 255-263 and 302-318 in DNA ligase 
from Bacteriophage T7 5 and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 96-106, 121-129, 256-262 and 304-316 (numbering of 
amino acids according to SEQ ID NO: 53). It is preferred that the DNA ligase from 
Bacteriophage T7 or a derivative or homologue thereof is used as the scaffold. 

A third aspect of the invention is directed to a method for generating engineered 
enzymes with specificities that are qualitatively and/or quantitatively novel in 
combination with the protein scaffold. The inventive method comprises at least 
the following steps: 

(a) providing a protein scaffold capable to catalyze at least one chemical 
reaction on at least one target substrate, 
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(b) generating a library of engineered enzymes or isolated engineered 
enzymes by combining the protein scaffold from step (a) with one or more 
fully or partially random peptide sequences at sites in the protein scaffold that 
enable the resulting engineered enzyme to discriminate between at least one 
target substrate and one or more different substrates and 

(c) selecting out of the library of engineered enzymes generated in step (b) 
one or more enzymes that have defined specificities towards at least one 
target substrate. 

In a first variant of this aspect of the invention, the inventive method comprises 
at least the following steps: 

(a) providing a protein scaffold capable to catalyze at least one chemical 
reaction on at least one target substrate, 

(b) generating a library of engineered enzymes or isolated engineered 
enzymes by inserting into the protein scaffold from step (a) one or more fully 
or partially random peptide sequences at sites in the protein scaffold that 
enable the resulting engineered enzyme to discriminate between at least one 
target substrate and one or more different substrates and 

(c) selecting out of the library of engineered enzymes generated in step (b) 
one or more enzymes that have defined specificities towards at least one 
target substrate. 

Preferably, the positions at which the one or more fully or partially random 
peptide sequences are combined with or inserted into the protein scaffold are 
identified prior to the combination or insertion. 

The number of insertions or other combinations of fully or partially random 
peptide sequences as well as their length may vary over a wide range. The 
number is at least one, preferably more than one, more preferably between two 
and eleven, most preferably between two and six. The length of such fully or 
partially random peptide sequences is usually less than 50 amino acid residues. 
Preferably, the length is between one and 15 amino acid residues, more 
preferably between one and six amino acid residues. Alternatively, the length is 
between two and 20 amino acid residues, preferably between two and ten amino 
acid residues, more preferably between three and eight amino acid residues. 
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Preferably such insertions or other combinations are performed on the DNA level, 
using polynucleotides encoding such protein scaffolds and polynucleotides or 
oligonucleotides encoding such fully or partially random peptide sequences. 

Optionally, steps (a) to (c) are repeated cyclically, whereby enzymes selected in 
step (c) serve as the protein scaffold in step (a) of a further cycle, and 
randomized peptide sequences are either inserted or, alternatively, substituted 
for peptide sequences that have been inserted in former cycles. Thereby, the 
number of inserted peptide sequences is either constant or increases over the 
cycles. The cycles are repeated until one or more enzymes with the intended 
specificities are generated. 

Moreover, during or after one or more rounds of steps (a) to (c), the scaffold 
may be mutated at one or more positions in order to make the scaffold more 
acceptable for the combination with SDR sequences, and/or to increase catalytic 
activity at a specific pH and temperature, and/or to change the glycosylation 
pattern, and/or to decrease sensitivity towards enzyme inhibitors, and/or to 
change enzyme stability. 

In a second variant of this aspect of the invention, the inventive method 
comprises at least the following steps: 

(a) providing a first protein scaffold fragment, 

(b) connecting said protein scaffold fragment via a peptide linkage with a first 
SDR, and optionally 

(c) connecting the product of step (b) via a peptide linkage with a further SDR 
peptide or with a further protein scaffold fragment, and optionally 

(d) repeating step (c) for as many cycles as necessary in order to generate a 
sufficiently specific enzyme, and 

(e) selecting out of the population generated in steps (a) - (d) one or more 
enzymes that have the desired specificities toward the one or more target 
substrates. 

Protein scaffold fragment means a part of the sequence of a protein scaffold. A 
protein scaffold is comprised of at least two protein scaffold fragments. 
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In a third variant of this aspect of the invention, the protein scaffold, the SDRs 
and the engineered enzyme are encoded by a DNA sequence and an expression 
system is used in order to produce the protein. In an alternative variant, the 
protein scaffold, the SDRs and/or the engineered enzyme are chemically 
synthesized from peptide building blocks. 

In a fourth variant of this aspect of the invention, the inventive method 
comprises at least the following steps: 

(a) providing a polynucleotide encoding a protein scaffold capable of catalyzing 
one or more chemical reactions on one or more target substrates; 

(b) combining one or more fully or partially random oligonucleotide sequence 
with the polynucleotide encoding the protein scaffold, the fully or partially 
random oligonucleotide sequences being located at sites in the polynucleotide 
that enable the encoded engineered enzyme to discriminate between the one or 
more target substrates and one or more other substrates; and 

(c) selecting out of the population generated in step (b) one or more 
polynucleotides that encode enzymes that have the defined specificities toward 
the one or more target substrates. 

Any enzyme can serve as the protein scaffold in step (a). It can be a naturally 
occurring enzyme, a variant or a truncated derivate therefore, or an engineered 
enzyme. For human therapeutic use, the protein scaffold is preferably a 
mammalian enzyme, and more preferably a human enzyme. In that aspect, the 
invention is directed to a method for the generation of essentially mammalian, 
especially of essentially human enzymes with specificities that are different from 
specificities of any enzyme encoded in mammalian genomes or in the human 
genome, respectively. 

According to the invention, the protein scaffold provided in step (a) of this aspect 
requires to be capable of catalyzing one or more chemical reactions on a target 
substrate. Therefore, a protein scaffold is selected from the group of potential 
protein scaffolds by its activity on the target substrate. 

In a preferred variant of this aspect of the invention, a protein scaffold with 
hydrolase activity is used. Preferably, a protein scaffold with proteolytic activity 
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is used, and more preferably, a protease with very low specificity having basic 
activity on the target substrate is used as the protein scaffold. Examples of 
proteases from different structural classes with low substrate specificity are 
Papain, Trypsin, Chymotrypsin, Subtilisin, SET (trypsin- like serine protease from 
Streptomyces erythraeus), Elastase, Cathepsin G or Chymase. Before being 
employed as the protein scaffold, the amino acid sequence of the protease may 
be modified in order to change protein properties other than specificity, e.g 
catalytic activity, stability, inhibitor sensitivity, or expression yield, essentially as 
described in WO 92/18645, or in order to change specificity, essentially as 
described in EP 02020576.3 and PCT/EP03/04864. 

Another option for a feasible protein scaffold are lipases. Hepatic lipase, 
lipoprotein lipase and pancreatic lipase belong to the "lipoprotein lipase 
superfamily", which in turn is an example of the GX-class of lipases (M. Fischer, 
J. Pleiss (2003), Nucl. Acid. Res., 31, 319-321). The substrate specificity of 
lipases can be characterized by their relative activity towards triglycerol esters of 
fatty acids and phospholipids, bearing a charged head group. Alternatively, other 
hydrolases such as esterases, glycosylases, amidases, or nitrilases may be used 
as scaffolds. 

Transferases are also feasible protein scaffolds. Glycoslytransferases are involved 
in many biological synthesis involving a variety of donors and acceptors. 
Alternatively, the protein scaffold may have ligase, lyase, oxidoreductase, or 
isomerase activity. 

In a first embodiment , the one or more fully or partially random peptide 
sequences are inserted at specific sites in the protein scaffold. These insertion 
sites are characterized by the fact that the inserted peptide sequences can act as 
discriminators between different substrates, i.e. as Specificity Determining 
Regions or SDRs. Such insertion sites can be identified by several approaches. 
Preferably, insertion sites are identified by analysis of the three-dimensional 
structure of the protein scaffolds, by comparative analysis of the primary 
sequences of the protein scaffold with other enzymes having different 
quantitative specificities, or experimentally by techniques such as alanine 
scanning, random mutagenesis, or random deletion, or by any combination 
thereof. 
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A first approach to identify insertion sites for SDRs bases on the three- 
dimensional structure of the protein scaffold as it can be obtained by x-ray 
crystallography or by nuclear magnetic resonance studies. Structural alignment 
of the protein scaffold in comparison with other enzymes of the same structural 
class but having different quantitative specificities reveals regions of high 
structural similarity and regions with low structural similarity. Such an analysis 
can for example be done using public software such as Swiss PDB viewer (Guex, 
N. and Peitsch, M.G. (1997) Electrophoresis 18, 2714-2723). Regions of low 
structural similarity are preferred SDR insertion sites. 

In a second approach to identify insertion sites for SDRs, three-dimensional 
structures of the scaffold protein in complex with competitive inhibitors or 
substrate analogs are analysed. It is assumed that the binding site of a 
competitive inhibitor significantly overlaps with the binding site of the substrate. 
In that case, atoms of the protein that are within a certain distance of atoms of 
the inhibitor are likely to be in a similar distance to the substrate as well. 
Choosing a short distance, e.g. < 5 A, will result in an ensemble of protein atoms 
that are in close contact with the substrate. These residues would constitute the 
first shell contacts and are therefore preferred insertion sites for SDRs. Once first 
shell contacts have been identified, second shell contacts can be found by 
repeating the distance analysis starting from first shell atoms. In yet another 
alternative of the invention the distance analysis described above is performed 
starting from the active site residues. 

In third approach to identify insertion sites for SDRs, the primary sequence of 
the scaffold protein is aligned with other enzymes of the same structural class 
but having different quantitative specificities using an alignment algorithm. 
Examples of such alignment algorithms are published (Altschul, S.F., Gish, W., 
Miller, W., Myers, E.W. & Lipman, D.J. (1990) J. Mol. Biol. 215:403-410; 
"Statistical methods in Bioinformatics: an introduction" by Ewens, W. & Grant, 
G.R. 2001, Springer, New York). Such an alignment may reveal conserved and 
non-conserved regions with varying sequence homology, and, in particular, 
additional sequence elements in one or more enzymes compared to the scaffold 
protein. Conserved regions of are more likely to contribute to phenotypes shared 
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among the different proteins, e.g. stabilizing the three-dimensional fold. Non- 
conserved regions and, in particular, additional sequences in enzymes with 
quantitatively higher specificity (Turner, R. et al. (2002) J. Biol. Chem., 277, 
33068-33074) are preferred insertion sites for SDRs. 

For proteases currently five families are known, namely aspartic-, cysteine-, 
serine-, metallo- and threonine proteases. Each family includes groups of 
proteases that share a similar fold. Crystaliographic structures of members of 
these groups have been solved and are accessible through public databases, e.g. 
the Brookhaven protein database (H.M. Berman et al. Nucleic Acids Research, 28 
pp. 235-242 (2000)). Such databases also include structural homologs in other 
enzyme classes and nonenzymatically active proteins of each class. Several tools 
are available to search public databases for structural homologues: SCOP - a 
structural classification of proteins database for the investigation of sequences 
and structures. (Murzin A. G. et al. (1995) J. Mol. Biol. 247, 536-540); CATH - 
Class, Architecture, Topology and Homologous superfamily: a hierarchical 
classification of protein domain structures (Orengo et al. (1997) Structure 5(8) 
1093-1108); FSSP- Fold classification based on structure- structure alignment of 
proteins (Holm and Sander (1998) Nucl. Acids Res. 26 316-319); or VAST - 
Vector alignment search tool (Gibrat, Madej and Bryant (1996) Current Opinion 
in Structural Biology 6, 377-385). 

In the above described approaches, members of structural classes are compared 
in order to identify insertion sites for SDRs. 

In a preferred variant of these approaches serine proteases of the structural 
class S1 are compared with each other. Trypsin represents a member with low 
substrate specificity, as it requires only an arginine or lysine residue at the R 
position. On the other hand, thrombin, tissue-type plasminogen activator or 
enterokinase all have a high specificity towards their substrate sequences, i.e. 
(L/I/V/F)XPR A NA, CPGR A VVGG and DDDK A , respectively (Perona, J. & Craik, C. 
(1997) J. Biol. Chem., 272, 29987-29990; Perona, J. & Craik, C (1995) Protein 
Science, 4, 337-360). An alignment of the amino acid sequences of these 
proteases is described in example 1 (Figure 2) along with the identification of 
SDRs. 
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A further example within the family of serine proteases is given by members of 
the structural class S8 (subtilisin fold). Subtilisin is the type protease for this 
class and represents an unspecific protease (Ottesen,M. & Svendsen,A. (1998) 
Methods EnzymoL 19, 199-215). Furin, PC1 and PC5 are proteases of the same 
structural class involved in the processing of propeptides and have a high 
substrate specificity (Seidah, N. & Chretien, M. (1997) Curr. Opin. Biotech., 8: 
602-607; Bergeron, F. et al. (2000) J. Mot. Eidocrin., 24:1-22). In a preferred 
variant of the approach alignments of the primary amino acids sequences (Figure 
4) are used to identify eleven sequence stretches longer than three amino acids 
which specific proteases have in addition compared to subtilisin and are therefore 
potential specificity determining regions. In a further variant of the approach 
information from the three-dimensional structure of subtilisin can be used in 
order to further narrow down the selection (Figure 3). Out of the eleven inserted 
sequence stretches, three are especially close to the active site residues, namely 
stretch number 7, 8 and 11 which are insertions in PCS, PC1 and all three 
specific proteases, respectively (Figure 3). In a preferred variant, one or several 
amino acid stretches of variable length and composition can be inserted into the 
subtilisin sequence at one or several of the eleven positions. In a more preferred 
variant of the approach the insertion is performed at regions 7, 8 or 1 1 or any 
combination thereof. In another preferred variant of the approach protease 
scaffolds other than subtilisin from the structural class S8 are used. 

In a further preferred variant of this approach, aspartic acid proteases of the 
structural class A1 are analyzed (Rawlings, N.D. & Barrett, A.J. (1995). Methods 
EnzymoL 248, 105-120; Chitpinityol, S. & Crabbe, MJ. (1998), Food Chemistry, 
61, 395-418). Examples for the A1 structural class of aspartic proteases are 
pepsin with a low as well as beta-secretase (Gruninger-Leitch, F., et al. (2002) d 
Biol. Chem. 277, 4687-4693) and renin (Wang, W. & Liang, TC. (1994) 
Biochemistry, 33, 14636-14641) with relatively high substrate specificities. 
Retroviral proteases also belong to this class, although the active enzyme is a 
dimer of two identical subunits. The viral proteases are essential for the correct 
processing of the polyprotein precursor to generate functional proteins which 
requires a high substrate specificity in each case (Wu, J. et al. (1998) 
Biochemistry, 37, 4518-4526; Pettit, S. et al. (1991) J. Biol. Chem., 266, 14539- 
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14547). Pepsin is the type protease for this class and represents an unspecific 
protease (Kageyama, T. (2002) Cell. Mol. Life ScL 59, 288-306). B-secretase and 
Cathepsin D (Aguilar, C. F. et al. (1995) Adv. Exp. Med. Biol. 362, 155-166) are 
proteases of the same structural class and have a high substrate specificity. In a 
preferred variant of the approach alignments of the primary amino acids 
sequences (Figure 6) are used to identify six sequence stretches longer than 
three amino acids which are inserted in the specific proteases compared to 
pepsin and are therefore potential specificity determining regions. In a further 
variant of the approach information from the three-dimensional structure of b- 
secretase can be used in order to further narrow down the selection. Out of the 
six inserted sequence stretches, three are especially close to the active site 
residues, namely stretch number 1, 3 and 4 which are insertions in cathepsin D 
and beta- seer etase, respectively (Figure 5). In a preferred variant of the 
approach, one or several amino acid stretches of variable length and composition 
can be inserted into the pepsin sequence at one or several of the six positions, in 
a more preferred embodiment of the invention the insertion is performed at the 
positions 1 , 3 or 4 or any combination thereof. In another preferred embodiment 
of the invention protease scaffolds other than pepsin are used. 

There are cases where a certain structural class does not include known 
members of bw and high specificity. This is exemplified by the C14 class of 
caspases which belong to the cysteine protease family (Rawlings, N.D. & Barrett, 
A.J. (1994) Methods Enzymoi. 244, 461 -486 ) and which all show high specificity 
for P 4 to R positions. For example, caspase-1, caspase-3 and caspase-9 
recognize the sequences YVAD A , DEVD A or LEHD A , respectively. Identification 
of the regions that differ between the caspases will include the regions 
responsible for the differences in substrate specificity (Figures 7 and 8). 

Finally, non-enzymatic proteins of the same fold as the enzyme scaffold may also 
contribute to the identification of insertion sites for SDRs. For example, 
haptoglobin (Arcoleo, J. & Greer, J.; (1982) Jl Biol Chem. 257, 10063-10068) 
and azurocidin (Almeida, R. et al. (1991) Biochem. Biophys. Res. Commun. 177, 
688-695) share the same chymotrypsin-like fold with all S1 proteases. Due to 
substitutions in the active site residues these proteins do not posses any 
proteolytic function, yet they show high homology with active proteases. 



WO 2004/113522 



PCT/EP2004/051173 



75 

Differences between these proteins and specific proteases include regions that 
can serve as insertion sites for SDRs. 

In a fourth approach, insertion sites for SDRs are identified experimentally by 
techniques such as alanine scanning, random mutagenesis, random insertion or 
random deletion. In contrast to the approach disclosed above, this approach 
does not require detailed knowledge about the three-dimensional structure of the 
scaffold protein. In one preferred variant of this approach, random mutagenesis 
of enzymes with relatively high specificity from the same structural class as the 
protein scaffold and screening for loss or change of specificity can be used to 
identify insertion sites for SDRs in the protein scaffold. 

Random mutagenesis, alanine scanning, random insertion or random deletion are 
all done on the level of the polynucleotides encoding the enzymes. There are a 
variety of protocols known in the literature (e.g. Sambrook, J.F; Fritsch, E.F.; 
Maniatis,T.; Cbld Spring Harbor Laboratory Press, Second Edition, 1989, New 
York). For example, random mutagenesis can be achieved by the use of a 
polymerase as described in patent WO 9218645. According to this patent, the 
one or more genes encoding the one or more proteases are amplified by use of a 
DNA polymerase with a high error rate or under conditions that increase the rate 
of misincorporations. For example the method of Cadwell and Joyce can be 
employed (Cadwell, R.C. and Joyce, G.F., PCR methods. Appl. 2 (1992) 28-33). 
Other methods of random mutagenesis such as, but not limited to, the use of 
mutator stains, chemical mutagens or UV-radiation can be employed as well. 

Alternatively, oligonucleotides can be used for mutagenesis that substitute 
randomly distributed amino acid residues with an alanine. This method is 
generally referred to as alanine scanning mutagenesis (Fersht, A.R. Biochemistry 
(1989) 8031-8036). As a further alternative, modifications of the alanine 
scanning mutagenesis such as binominal mutagenesis (Gregoret, LM. and Sauer, 
R.T. PNAS (1993) 4246-4250) or combinatorial alanine scanning (Weiss et a!., 
PNAS (2000) 8950-8954) can be employed. 

In order to express engineered enzymes, the DNA encoding such engineered 
proteins is ligated into a suitable expression vector by standard molecular cloning 
techniques (e.g. Sambrook, J.F; Fritsch, E.F.; Maniatis, T.; Cold Spring Harbor 
Laboratory Press, Second Edition, 1989, New York). The vector is introduced in a 
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suitable expression host cell, which expresses the corresponding engineered 
enzyme variant. Particularly suitable expression hosts are bacterial expression 
hosts such as Escherichia coli or Bacillus subtilis, or yeast expression hosts such 
as Saccharomyces cerevisae or Pichia pastoris, or mammalian expression hosts 
such as Chinese Hamster Ovary (CHO) or Baby Hamster Kidney (BHK) cell lines, 
or viral expression systems such as bacteriophages like M13 or Lambda, or 
viruses such as the Baculovirus expression system. As a further alternative, 
systems for in vitro protein expression can be used. Typically, the DNA is ligated 
into an expression vector behind a suitable signal sequence that leads to 
secretion of the enzyme variants into the extracellular space, thereby allowing 
direct detection of protease activity in the cell supernatant. Particularly suitable 
signal sequences for Escherichia coli are HlyA, for Bacillus subtilis AprE, NprB, 
Mpr, AmyA, AmyE, Blac, SacB, and for S. cerevisiae Barl, Suc2, Matoc, InulA, 
Ggplp. Alternatively, the enzyme variants are expressed intracellularly and the 
substrates are expressed also intracellularly. Preferably, this is done essentially 
as described in patent application WO 0212543, using a fusion peptide substrate 
comprising two auto-fluorescent proteins linked by the substrate amino-acid 
sequence. As a further alternative, after intracellular expression of the enzyme 
variants, or secretion into the periplasmatic space using signal sequences such 
as DsbA, PhoA, PelB, OmpA, OmpTor gill for Escherichia coli, a permeabilisation 
or lysis step releases the enzyme variants into the supernatant. The destruction 
of the membrane barrier can be forced by the use of mechanical means such as 
ultrasonic, French press, or the use of membrane-digesting enzymes such as 
lysozyme. As another, further alternative, the genes encoding the enzyme 
variants are expressed cell-free by the use of a suitable cell-free expression 
system. For example, the S30 extract from Escherichia coli cells is used for this 
purpose as described by Lesly et al. (Methods in Molecular Biology 37 (1995) 
265-278). 

The ensemble of gene variants generated and expressed by any of the above 
methods are analyzed with respect to their affinity, substrate specificity or 
activity by appropriate assay and screening methods as described in detail for 
example in patent application PCT/EP03/ 04864. Genes from catalytically active 
variants having reduced specificity in comparison to the original enzyme are 
analyzed by sequencing. Sites at which mutations and/or insertions and/or 
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deletions occurred are preferred insertion sites at which SDRs can be inserted 
site-specifically. 

In a second embodiment , the one or more fully or partially random peptide 
sequences are inserted at random sites in the protein scaffold. This modification 
is usually done on the polynucleotide level, i.e. by inserting nucleotide sequences 
into the gene that encodes the protein scaffold. Several methods are available 
that enable the random insertion of nucleotide sequences. Systems that can be 
used for random insertion are for example ligation based systems (Murakami et 
al. Nature Biotechnology 20 (2002) 76-81), systems based on DNA 
polymerisation and transposon based systems (e.g. GPS-M™ mutagenesis 
system, NEB Biolabs; MGS™ mutation generation system, Finnzymes). The 
transposon- based methods employ a transposase- mediated insertion of a 
selectable marker gene that contains at its termini recognition sequences for the 
transposase as well as two sites for a rare cutting restriction endonuclease. Using 
the latter endonuclease one usually releases the selection marker and after 
religation obtains an insertion. Instead of performing the religation one can 
alternatively insert a fragment that has terminal recognition sequences for one or 
two outside cutting restriction endonuclease as well as a selectable marker. After 
ligation, one releases this fragment using the one or two outside cutting 
endonucleases. After creating blunt ends by standard methods one inserts blunt 
ended random fragments at random positions into the gene. 
In a further preferred embodiment, methods for homologous in-vitro 
recombination are used to combine the mutations introduced by the above 
mentioned methods to generate enzyme populations. Examples of methods that 
can be applied are the Recombination Chain Reaction (RCR) according to patent 
application WO 0134835, the DNA-Shuffling method according to the patent 
application WO 9522625, the Staggered Extension method according to patent 
WO 9842728, or the Random Priming recombination according to patent 
application W09842728. Furthermore, also methods for non-homologous 
recombination such as the Itchy method can be applied (Ostermeier, M. et al. 
Nature Biotechnology 17 (1999) 1205-1209). 

Upon random insertion of a nucleotide sequence into the protein scaffold one 
obtains a library of different genes encoding enzyme variants. The polynucleotide 
library is subsequently transferred to an appropriate expression vector. Upon 
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expression in a suitable host or by use of an in vitro expression system, a library 
of enzymes containing randomly inserted stretches of amino acids is obtained. 

According to step (b) of this third aspect of the invention, one or more fully or 
partially random peptide sequences are inserted into the protein scaffold. The 
actual number of such inserted SDRs is determined by the intended quantitative 
specificity following the relation: the higher the intended specificity is, the more 
SDRs are inserted. Whereas a single SDR enables the generation of moderately 
specific enzymes, two SDRs enable already the generation of significantly specific 
enzymes. However, up to six and more SDRs can be inserted into a protein 
scaffold. A similar relation is valid for the length of the SDRs: the higher the 
intended specificity is, the longer are the SDRs that are to be inserted. SDRs can 
be as short as one to four amino acid residues. They can, however, also be as 
long as 50 amino acid residues. Significant specificity can already be generated 
by the use of SDRs of a length of four to six amino acid residues. 

The peptid sequences that are inserted can be fully or partially random. In this 
context, fully random means that a set of sequences are inserted in parallel that 
includes sequences that differ from each other in each and every position. 
Partially random means that a set of sequences are inserted in parallel that 
includes sequences that differ from each other in at least one position. This 
difference can be either pair-wise or with respect to a single sequence. For 
example, when regarding an insertion of the length of four amino acids, partial 
random could be a set (i) that includes AGGG, GVGG, GGLG, GGGI, or (ii) that 
includes AGGG, VGGG, LGGG and IGGG. Alternatively, random sequences also 
comprises sequences that differ from each other in length. Randomization of the 
peptide sequences is achieved by randomization of the nucleotide sequences that 
are inserted into the gene at the respective sites. Thereby, randomization can be 
achieved by employing mixtures of nucleobases as monomers during chemical 
synthesis of the oligonucleotides. A particularly preferred mixture of monomers 
for a fully random codon that in addition minimizes the probability of stop codons 
is NN(GTC). Alternatively, random oligonucleotides can be obtained by 
fragmentation of DNA into short fragments that are inserted into the gene at the 
respective sites. The source of the DNA to be fragmented may be a synthetic 
oligonucleotide but alternatively may originate from cloned genes, cDNAs, or 
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genomic DNA. Preferably, the DNA is a gene encoding an enzyme. The 
fragmentation can, for example, be achieved by random endonucleolytic 
digestion of DNA. Preferably, an unspecific endonuclease such as DNAse I (e.g. 
from bovine pancreas) is employed for the endonucleolytic digestion. 

If steps (a) - (c) of the inventive method are repeated cyclically, there are 
different alternatives for obtaining random peptide sequences that are inserted in 
consecutive rounds. Preferably, SDRs that were identified in one round as leading 
to increased specificity of enzyme are used as templates for the random peptide 
sequences that are inserted in the following round. 

In a preferred alternative, the sequences selected in one round are analysed and 
randomized oligonucleotides are generated based on these sequences. This can, 
for example, be achieved by using in addition to the original nucleotide with a 
certain percentage mixtures of the other three nucleotides monomers at each 
position in the oligonucleotide synthesis. If, for example, in a first round an SDRs 
is identified that has the amino acid sequence ARLT, e.g. encoded by the 
nucleotide sequence GCG CGC CTT ACC, a random peptide sequence inserted in 
this SDR site could be encoded by an oligonucleotide with 70% G, 10% A, 10% T 
and 10% C at the first position, 70% C, 10% G, 10% T and 10% A at the second 
position, etc. This leads at each position approximately in 1 of 3 cases to the 
template amino acid and in 2 of 3 cases to another amino acid. 
In another preferred alternative, the sequences selected in one round are 
analyzed and a consensus library is generated based on these sequences. This 
can, for example, be achieved by using defined mixtures of nucleotides at each 
position in the oligonucleotide synthesis in a way that leads to mixtures of the 
amino acid residues that were identified at each position of the SDR selected in 
the previous round. If, for example, in a first round two SDRs are identified that 
have the amino acid sequences ARLT and VPGS, a consensus library inserted in 
this SDR site in the following round could be encoded by an oligonucleotide with 
the sequence G(C/T)G C(G/C)C (G/T)(G/T)G (A/T)CC. This would correspond to 
the random peptide sequence (A/V)(R/P)(L/G/V/W)(T/S), thereby allowing all 
combinations of the amino acid residues identified in the first round, and, due to 
the degeneracy of the genetic code, allowing in addition to a lower degree 
alternative amino acid residues at some positions. 
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In another preferred alternative, the sequences selected in one round are, 
without previous analysis, recombined using methods for the in vitro 
recombination of polynucleotides, such as the methods described in WO 
01/34835 (the following also provides details of the eighth and ninth aspect of 
the invention). 

After insertion of the partially or fully random sequences into the gene encoding 
the scaffold protein, and eventually ligation of the resulting gene into a suitable 
expression vector using standard molecular cloning techniques (Sambrook, J.F; 
Fritsch, E.F.; Maniatis,T.; Cold Spring Harbor Laboratory Press, Second Edition, 
1989, New York), the vector is introduced in a suitable expression host cell which 
expresses the corresponding enzyme variant. Particularly suitable expression 
hosts are bacterial expression hosts such as Escherichia coli or Bacillus subtilis, 
or yeast expression hosts such as Saccharomyces cerevisae or Pichia pastoris, or 
mammalian expression hosts such as Chinese Hamster Ovary (CHO) or Baby 
Hamster Kidney (BHK) cell lines, or viral expression systems such as 
bacteriophages like M13 T7 phage or Lambda, or viruses such as the Baculovirus 
expression system. As a further alternative, systems for in vitro protein 
expression can be used. Typically, the DNA is ligated into an expression vector 
behind a suitable signal sequence that leads to secretion of the enzyme variants 
into the extracellular space, thereby allowing direct detection of enzyme activity 
in the ceil supernatant. Particularly suitable signal sequences for Escherichia coli 
are ompA, pelB, HlyA, for Bacillus subtilis AprE, NprB, Mpr, AmyA, AmyE, Blac, 
SacB, and for S. cerevisiae Bar1, Suc2, Mata, InulA, Ggplp. Alternatively, the 
enzyme variants are expressed intracellular^ and the substrates are expressed 
also intracellular^. According to protease variants this is done essentially as 
described in patent application WO 0212543, using a fusion peptide substrate 
comprising two auto -fluorescent proteins linked by the substrate amino-acid 
sequence. As a further alternative, after intracellular expression of the enzyme 
variants, or secretion into the periplasmatic space using signal sequences such 
as DsbA, PhoA, PelB, OmpA, OmpT or gill for Escherichia coli, a permeabilisation 
or lysis step releases the enzyme variants into the supernatant. The destruction 
of the membrane barrier can be forced by the use of mechanical means such as 
ultrasonic, French press, or the use of membrane-digesting enzymes such as 
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lysozyme. As another, further alternative, the genes encoding the enzyme 
variants are expressed cell-free by the use of a suitable cell-free expression 
system. For example, the S30 extract from Escherichia coli cells is used for this 
purpose as described by Lesly et al. (Methods in Molecular Biology 37 (1995) 
265-278). 

After introduction of the vector into host cells, these cells are screened for the 
expression of enzymes with specificity for the intended target substrate. Such 
screening is typically done by separating the cells from each other, in order to 
enable the correlation of genotype and phenotype, and assaying the activity of 
each cell clone after a growth and expression period. Such separation can for 
example be done by distribution of the cells into the compartments of sample 
carriers, e.g. as described in WO 01/24933. Alternatively, the cells are separated 
by streaking on agar plates, by enclosing in a polymer such as agarose, by filling 
into capillaries, or by similar methods. 

Identification of variants with the intended specificity can be done by different 
approaches. In the case of proteases, preferably assays using peptide substrates 
essentially as described in PCT/EP03/04864 are employed. 

Regardless of the expression format, selection of enzyme variants is done under 
conditions that allow identification of enzymes that recognize and convert the 
target sequence preferably. As a first alternative, enzymes that recognize and 
convert the target sequence preferably are identified by screening for enzymes 
with a high affinity for the target substrate sequence. High affinity corresponds 
to a low K/i which is selected by screening at target substrate concentrations 
substantially below the K M of the first enzyme. Preferably, the substrates that are 
used are linked to one or more fluorophores that enable the detection of the 
modification of the substrate at concentrations below 10 \xM, preferably below 1 
HM, more preferably below 100 nM, and most preferably below 10 nM. 

As a second alternative, enzymes that recognize and convert the target substrate 
preferably are identified by employing two or more substrates in the assay and 
screening for activity on these two or more substrates in comparison, Preferably, 
the two or more substrates employed are linked to different marker molecules, 
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thereby enabling the detection of the modification of the two or more substrates 
consecutively or in parallel, in the case of proteases, particularly preferably two 
peptide substrates are employed, one peptide substrate having an arbitrarily 
chosen or even partially or fully random amino-acid sequence thereby enabling 
to monitor the activity on an arbitrary substrate, and the other peptide substrate 
having an amino-acid sequence identical to or resembling the intended target 
substrate sequence thereby enabling to monitor the activity on the target 
substrate. Especially preferably, these two peptide substrates are linked to 
fluorescent marker molecules, and the fluorescent properties of the two peptide 
substrates are sufficiently different in order to distinguish both activities when 
measured consecutively or in parallel. For example, a fusion protein comprising a 
first autofluorescent protein, a peptide, and a second autofluorescent protein 
according to patent application WO 0212543 can be used for this purpose. 
Alternatively, fluorophores such as rhodamines are linked chemically to the 
peptide substrates. 

As a third alternative, enzymes that recognize and convert the target substrate 
preferably are identified by employing one or more substrates resembling the 
target substrate together with competing substrates in high excess. Screening 
with respect to activity on the substrates resembling the target substrate is then 
done in the presence of the competing substrates. Enzymes having a specificity 
which corresponds qualitatively to the target specificity, but having only a low 
quantitative specificity are identified as negative samples in such a screen. 
Whereas enzymes having a specificity which corresponds qualitatively and 
quantitatively to the target specificity are identified positively. Preferably, the 
one or more substrates resembling the target substrate are linked to marker 
molecules, thereby enabling the detection of their modifications, whereas the 
competing substrates do not carry marker molecules. The competing substrates 
have arbitrarily chosen or random amino-acid sequences, thereby acting as 
competitive inhibitors for the hydrolysis of the marker-carrying substrates. For 
example, protein hydrolysates such as Trypton can serve as competing 
substrates for engineered proteolytic enzymes according to the invention. 
As a fourth alternative, enzymes that recognize and convert the target substrate 
preferably are identified and selected by an amplification-coupled or growth- 
coupled selection step. Furthermore, the activity can be measured intracellular^ 
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and the selection can be done by a cell sorter, such as a fluorescence-activated 
cell sorter. 

As a further alternative, enzymes that recognize and convert the target substrate 
are identified by first selecting enzymes that preferentially bind to the target 
substrate, and secondly selecting out of this subgroup of enzyme variants those 
enzymes that convert the target substrate. Selection for enzymes that 
preferentially bind the target substrate can be either done by selection of binders 
to the target substrate or by counter-selection of enzymes that bind to other 
substrates. Methods for the selection of binders or for the counter-selection of 
non-binders is known in the art. Such methods typically require phenotype- 
genotype coupling which can be solved by using surface display expression 
methods. Such methods include, for example, phage or viral display, cell surface 
display and in vitro display. Phage or viral display typically involves fusion of the 
protein of interest to a viral/phage protein. Cell surface display, i.e. either 
bacterial or eukaryotic cell display, typically involves fusion of the protein of 
interest to a peptide or protein that is located at the cell surface. In in-vitro 
display, the protein is typically made in vitro and linked directly or indirectly to 
the mRNA encoding the protein (DE 19646372). 

The invention also provides for a composition or pharmaceutical composition 
comprising one or more engineered enzymes according to the first aspect of the 
invention as defined herein before. The composition may optionally comprise an 
acceptable carrier, excipient and/or auxiliary agent. 

Pharmaceutical compositions according to the invention may optionally comprise 
a pharmaceutical^ acceptable carrier. Pharmaceutical formulations are well 
known and pharmaceutical compositions may be routinely formulated by one 
having ordinary skill in the art. The composition can be formulated as a solution, 
suspension, emulsion, or lyophilized powderin association with a pharma- 
ceutical^ acceptable vehicle. Examples of such vehicles are water, saline, 
Ringer's solution, dextrose solution, and human serum albumin. Liposomes and 
nonaaqueous vehicles such as fixed oils may also be used. The vehicle or 
lyophilized powder may contain additives that maintain isotonicity (e.g. sodium 
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chloride, mannitol) and chemical stability (e.g. buffers and preservatives). The 
composition is sterilized by commonly used techniques. 

The pharmaceutical composition of the present invention may be administrated 
by any means that enables the active agent to reach the agent' s site of action in 
the body of a mammal. Pharmaceutical compositions may be administered 
parentally, i.e. intravenous, subcutaneous, intramuscular. 

Dosage varies depending upon known factors such as the pharmacodynamic 
characteristics of the particular agent, and its mode and route of administration ; 
age, health, and weight of the recipient; nature and extent of symptoms, kind of 
concurrent treatment, frequency of treatment, and the effect desired. 

Non-pharamceutical compositions as defined herein are research composition, 
nutritional composition, cleaning composition, desinfection composition, cosmetic 
composition or composition for personal care. Moreover, DNA sequences coding 
for the engineered enzyme as defined herein before and vectors containing said 
DNA sequences are also provided. Finally, transformed host cells (prokaryotic or 
eukaryotic) or transgenic organisms containing such DNA sequences and/or 
vectors, as well as a method utilizing such host cells or transgenic animals for 
producing the engineered enzyme of the first aspect of the invention are also 
contemplated. 

Detailed description of the figures 

Figure 1 : Three-dimensional structure of human trypsin I with the active site 
residues shown in "ball-and-stick" representation and with the marked regions 
indicating potential SDR insertion sites. 

Figure 2: Alignment of the primary amino acid sequences of the human 
proteases trypsin I, alpha-thrombin and enteropeptidase all of which belong to 
the structural class S1 of the serine protease family. Trypsin represents an 
unspecific protease of this structural class, while alpha-thrombin and 
enteropeptidase are proteases with high substrate specificity. Compared to 
trypsin several regions of insertions of three or more amino acids into the 
primary sequence of a-thrombin and enterokinase are seen. The region marked 
with and the region marked with (-3-) are preferred SDR insertion sites. In 
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the tertiary structure of alpha-thrombin both regions are in the vicinity of the 
substrate binding site. These regions therefore fullfil two criteria to be selected 
as candidates for SDRs: firstly, they represent insertions in the specific proteases 
compared to the unspecific one and, secondly, they are close to the substrate 
binding site. A representation of the three-dimensional structure is given in 
figure 3. 

Rgure 3: Three-dimensional structure of subtilisin with the active site residues 
being shown in "ball-and-stick" representation and with the numbered regions 
indicating potential SDR insertion sites. 

Figure 4: Alignment of the primary amino acid sequences of subtilisin E, furin, 
PC1 and PCS all of which belong to the structural class S8 of the serine protease 
family. Subtilisin E represents an unspecific protease of this structural class, 
while furin, PC1 and PC5 are proteases with high substrate specificity. Compared 
to subtilisin several regions of insertions of three or more amino acids into the 
primary sequence of furin, PC1 and PC5 are seen. The regions marked with (-4-), 
(-5-), (-7-), (-9-) and (-11-) are preferred SDR insertion sites. These regions 
stretches fulfill two criteria to be selected as candidates for SDRs: firstly, they 
represent insertions in the specific proteases compared to the unspecific one 
and, secondly, they are close to the active site residues. 

Figure 5: Three-dimensional structure of beta-secretase with the active site 
residues being shown in "ball-and-stick" representation and with the numbered 
regions indicating potential SDR insertion sites. 

Figure 6: Alignment of the primary amino acid sequences of pepsin, b-secretase 
and cathepsin D, all of which belong to the structural class A1 of the aspartic 
protease family. Pepsin represents an unspecific protease of this structural class, 
while b-secretase and cathepsin D are proteases with high substrate specificity. 
Compared to pepsin several regions of insertions of three or more amino acids 
into the primary sequence of b-secretase and cathepsin D are seen. The regions 
marked with -1- to -11- correspond to possible SDR combining sites and are 
also marked in Fig. 5. 



WO 2004/113522 



PCT/EP2004/051173 



86 

Figure 7 : illustrates the three-dimensional structure of caspase 7 with the active 
site residues being shown in "ball- and- stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 8 : shows the primary amino acid sequence of caspase 7 as a member of 
the cysteine protease class C14 family (see also SEQ ID NO: 14). 

Figure 9: Schematic representation of method according to the third aspect of 
the invention. 

Figure 10: Western blot analysis of trypsin expression. Supernatant of cell 
cultures expressing variants of trypsin are compared to negative controls. Lane 
1: molecular weight standard; lane 2: negative control; lane 3: supernatant of 
variant a; lane 4: negative control; lane 5: supernatant of variant b. A primary 
antibody specific to the expressed protein and a secondary antibody for 
generation of the signal were used. 

Figure 11: Time course of the proteolytic cleavage of a target substrate. 
Supernatant of cells containing the vector with the gene for human trypsin and 
that of cells containing the vector without the gene was incubated with the 
peptide substrate described in the text. Cleavage of the peptide results in a 
decreased read out value. Proteolytic activity is confirmed for the positive clone. 

Figure -12: Relative activity of three engineered proteolytic enzymes in 
comparison with human trypsin I on two different peptide substrates. A time 
course of the proteolytic digestion of the two substrates was performed and 
evaluated. Substrate B was used for screening and substrate A is a closely 
related sequence. Relative activity of the three variants was normalized to the 
activity of human trypsin I. Variant 1 and 2 clearly show hcreased specificity 
towards the target substrate. Variant 3, on the other hand, serves as a negative 
control with similar activities as the human trypsin I. 

Figure 13: Relative specificities of trypsin and variants of engineered proteolytic 
enzymes with one or two SDRs, respectively. Activity of the proteases was 
determined in the presence and absence of competitor substrate, i.e. peptone at 
a concentration of 10mg/ml. Time courses for the proteolytic cleavage were 
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recorded and the time constants k determined. The ratios between the time 
constants with and without competitor were formed and represent a quantitative 
measure for the specificity of the protease. The ratios were normalized to 
trypsin. The specificity of the variant containing two SDRs is 2.5 fold higher than 
that of the variant with SDR2 alone. 

Figure 14: Shows the relative specificities of protease variants in absence and 
presence of competitor substrate. The protease variants containig two inserts 
with different sequences and the non-modified scaffold human trypsin I were 
expressed in a suitable host. Activity of the protease variants was determined as 
the cleavage rate of a peptide with the desired target sequence of TNF-alpha in 
the absence and presence of competitor substrate. Specificity is expressed as the 
ratio of cleavage rates in the presence and absence of competitor. 

Figure 15 : The figure shows the reduction of cytotoxicity induced by human TNF- 
alpha when incubating the human TNF-alpha with concentrated supernatant from 
cultures expressing the inventive engineered proteolytic enzymes being specific 
for human TNF-alpha. This indicates the efficacy of the inventive engineered 
proteolytic enzymes. 

Figure 16: T he figure shows the reduction of cytotoxicity induced by human TNF- 
alpha when incubating the human TNF-alpha with different concentrations of 
purified inventive engineered proteolytic enzyme being specific for human TNF- 
alpha. Variant g comprises SEQ ID NO:72 as SDR1 and SEQ ID NO:73 as SDR2. 
This indicates the efficacy of the inventive engineered proteolytic enzymes. 

Fi gure 17 : The figure compares the activity of inventive engineered proteolytic 
enzymes being specific for human TNF-alpha with the activity of human trypsin I 
on two protein substrates: (a) human TNF-alpha; (b) mixture of human serum 
proteins. This indicates the safety of the inventive engineered proteolytic 
enzymes. Variant x corresponds to Seq ID No: 75 comprising the SDRs according 
to Seq ID No. 89 (SDR1) and 95 (SDR2) . Variants xi and xii correspond to 
derivatives thereof comprising the same SDR sequences. 

Figure 18: Specific hydrolysis of human VEGF by an engineered proteolytic 
enzyme derived from human trypsin. 
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Examples 

In the following examples, materials and methods of the present invention are 
provided including the determination of catalytic properties of enzymes obtained 
by the method. It should be understood that these examples are for illustrative 
purpose only and are not to be construed as limiting this invention in any 
manner. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference in their entirety for all purposes. 

In the experimental examples described below, standard techniques of 
recombinant DNA technology were used that were described in various 
publications, e.g. Sambrook et al. (1989), Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, or Ausubel et al. (1987), Current 
Protocols in Molecular Biology 1987-1988, Wiley Interscience. Unless otherwise 
indicated, restriction enzymes, polymerases and other enzymes as well as DNA 
purification kits were used according to the manufacturers specifications. 

Example I: Identifica tion of SDR sites in human trypsin 

Insertion sites for SDRs have been identified in the serine protease human 
trypsin I (structural class S1) by comparison with members of the same 
structural class having a higher sequence specificity. Trypsin represents a 
member with low substrate specificity, as it requires only an arginine or lysine 
residue at the P, position. On the other hand, thrombin, tissue-type plasminogen 
activator or enterokinase all have a high specificity towards their substrate 
sequences, i.e. (L/I/V/F)XPR A NA, CPGR A VVGG and DDDK A , respectively. The 
primary sequences and tertiary structures of these and further S1 serine 
proteases have been aligned in order to determine regions of low and high 
sequence and structure homology and especially regions that correspond to 
insertions in the sequences of the more specific proteases (Figure 2). Several 
regions of insertions equal or longer than 3 amino acids representing potential 
SDR sites have been identified as indicated in Figure 1. These regions were 
chosen as target sites for the insertion of SDRs in the examples below, e.g. SDR1 
(region one in figure 2, after amino acid 42 according to SEQ ID NO:1) with a 
length of six and SDR2 (region three in figure 2, after amino acid 123 according 
to SEQ ID NO:1) with a length of five amino acids, respectively. 
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Example II: Molecular cloning of the human trypsin I gene to be used as scaffold 
protein and expression of the mature protease in B. subtilis 

The gene encoding the unspecific protease human trypsinogen I was cloned into 
the vector pUC18. Cloning was done as follows: the coding sequence of the 
protein was amplified by PCR using primers that introduced a Kpnl site at the 5' 
end and a Bam HI site at the 3' end. This PCR fragment was cloned into the 
appropriate sites of the vector pl)Cl8, Identity was confirmed by sequencing. 
After sequencing the coding sequence of the mature protein was amplified by 
PCR using primers that introduced different Bgll sites at the 5' end and the 3' 
end. 

This PCR fragment was cloned into the appropriate sites of an E. coli - B. subtilis 
shuttle vector. The vector contains a pMBl origin for amplification in E. coli, a 
neomycin resistance marker for selection in E. coli, as well as a P43 promoter for 
the constitutive expression in B. subtilis. A 87 bp fragment that contains the 
leader sequence encoding the signal peptide from the sacB gene of B. subtilis 
was introduced behind the P43 promoter. Different Bgll restriction sites serve as 
insertion sites for heterologous genes to be expressed. 

Expression of human trypsin I was confirmed by measurement of the proteolytic 
aciticity in supernatant of cells containing the vector with the gene in comparison 
to a negative control. A peptide including an arginine cleavage site was chosen 
as a substrate. The peptide was N-terminally biotinylated and labeled with a 
fluorophore at the C-terminus. After incubation of the peptide with culture 
supernatant streptavidin was added. Uncleaved peptide associate with 
streptavidin and lead to a high read out value while cleavage results in low read 
out values. Figure 11 shows the time course of a proteolytic digestion of B. 
subtilis cells containing the vector with the trypsin I gene in comparison to B. 
subtilis cells containing the vector without the trypsin I gene (negative control). 
As a further confirmation of expression of the protease, supernatants of cells 
containing the vector with the gene and control cells were analyzed by 
polyacrylamid gel electrophoreses and subsequent western blot using an 
antibody specific to the target protease. The procedure was performed according 
to standard methods (Sambrook, J.F; Fritsch, E.F.; Maniatis,T.; Cold Spring 
Harbor Laboratory Press, Second Edition, 1989, New York). Figure 8 confirms 
expression of the protein only in the cells harbouring the vector with the gene for 
trypsin. 
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Example 111: Providing a scaffold protein 

In this example, human trypsin I was used as the scaffold protein. The gene was 
either used in its natural form, or, alternatively, was modified to result in a 
scaffold protein with increased catalytic activity or further improved 
characteristics. 

The modification was done by random modification of the gene, followed by 
expression of the enzyme and subsequent selection for increased activity. First, 
the gene was PCR amplified under error-prone conditions, essentially as 
described by Cadwell, R.C and Joyce, G.F. (PCR Methods Appl. 2 (1992) 28-33). 
Error-prone PCR was done using 30 pmol of each primer, 20 nmol dGTP and 
dATP, 100 nmol dCTP and dTTP, 20 fmol template, and 5 U Taq DNA polymerase 
in 10 mM Tris HCI pH 7.6, 50 mM KCI, 7 mM MgCI2, 0.5 mM MnCI2, 0.01 % 
gelatin for 20 cycles of 1 min at 94 °C, 1 min at 65 °C and 1 min at 72 °C. The 
resulting DNA library was purified using the Qiaquick PCR Purification Kit 
following the suppliers' instructions. The PCR product was digested with the 
restriction enzyme Bgl\ and purified. Afterwards, the PCR product was ligated 
into the E. coli - B. subtilis shuttle vector described above which was digested 
with Bgll and dephosphorylated. The ligation products were transformed into E. 
coli, amplified in LB, and the plasm ids were purified using the Qiagen Plasm id 
Purification Kit following the suppliers' instructions. Resulting plasmids were 
transformed into B. subtilis cells. 

Alternatively, or in addition to random mutagenesis, variants of the gene were 
statistically recombined at homologous positions by use of the Recombination 
Chain Reaction, essentially as described in WO 0134835. PCR products of the 
genes encoding the protease variants were purified using the QIAquick PCR 
Purification Kit following the suppliers' instructions, checked for correct size by 
agarose gel electrophoresis and mixed together in equimolar amounts. 80 ng of 
this PCR mix in 150 mM TrisHCI pH 7.6, 6.6 mM MgCb were heated for 5 min at 
94 °C and subsequently cooled down to 37 °C at 0.05 °C/s in order to re-anneal 
strands and thereby produce heteroduplices in a stochastic manner. Then, 2.5 U 
Exonuclease III per [ig DNA were added and incubated for 20, 40 or 60 min at 37 
°C in order to digest different lengths from both 3' ends of the heteroduplices. 
The partly digested PCR products were refilled with 0.6 U Pfu polymerase per jjg 
DNA by incubating for 15 min at 72 °C in 0.17 mM dNTPs and Pfu polymerase 
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buffer according to the suppliers' instructions. After performing a single PCR 
cycle, the resulting DNA was purified using the QIAquick PCR Purification Kit 
following the suppliers' instructions, digested with Bgll and ligated into the 
linearized vector. The ligation products were transformed into E. coii, amplified in 
LB containing ampicillin as marker, and the plasmids were purified using the 
Qiagen Plasm id Purification Kit following the suppliers' instructions. Resulting 
plasmids were transformed into B. subtilis cells. 

Example IV: Insertion of SDRs into the protein scaffold of human trypsin I and 
generation of an engineered p roteolytic enzvme with specificity for a peptide 
substrate having the sequence KKWLGRVPGGPV. 

In order to create insertion sites for SDRs in human trypsin I, two pairs of 
different restriction sites were introduced into the gene at sites that were 
identified as potential SDR sites (see Example I above) without changing the 
amino acid sequence. The insertion of the restriction sites was done by overlap 
extension PCR. Primers restn and restr2 were used for the introduction of Sacll 
and Bam HI restriction sites, restr3 and restr4 were used for the introduction of 
Kpnl and Nhel restriction sites. The sequences of the primers were as follows: 

Binding site for restrl and restr2 and the corresponding amino acid sequence 
(SEQ ID NO: 54): 

5 ' -6GTGGTAT CA6CAG GCCACT6CTACAAGTCCC GCATCCA GGT-3 1 
VVSAGHCYKSRI Q 

Forward primer restrl (SEQ ID NO: 56): 

5 1 — GGTGGTAT CCGCGG GCCACTGCTACAAGTCCC GGATCC AGGT— 3 1 

Reverse primer restr2 (SEQ ID NO:57): 

5 T — ACC TGGATCC GGGACTTGTAGCAGTGGC CCGCGG ATACCACC- 3 1 

Binding site for restr3 and restr4 and the corresponding amino acid sequence 
(SEQ ID NO: 58): 

5 1 — CCACT GGCACGA AGTGCCTCATCTCTGGCTGGGGCAACACT GCGAGC TCT— 3 1 
TGTKCLI SGWGNTASS 

Forward primer restr3 (SEQ ID NO: 60): 
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5 1 — CCACT GGCACGA AGTGCCTCATCTCTGGCTGGGGCAACACT GCGAGC TCT—3 1 

Reverse primer restr4 (SEQ I D NO:6i ): 

5 T — AGAGCTAGCAGTGTTGCCCCAGCCAGAGATGAGGCACTTGGTACCAGTGG— 3 ! 



In a first overlap extension PCR, the Sacll/BamHI sites were introduced, 
enabling to insert SDR1, and in a second overlap extension PCR the Kpnl/Nhel 
sites, enabling the insertion of SDR2. The product of the overlap extension PCR 
was amplified using primers pUG-forward and pUC-reverse. The sequences of 
pUC-forward and pUC- reverse are as follows: 

pUC-forward (SEQ ID NO: 62): 5 '- GG GGTACC CCACCACCATG AATCC ACTCCT-3 1 
pUC-reverse (SEQ I D NO:63) : 5 ' - CGGGATCCGGTATAG AG ACTG AAG AG ATAC - 3 ' 

The restriction sites generated thereby were subsequently used to insert defined 
or random oligonucleotides into the SDR1 and SDR2 insertion sites by standard 
restriction and ligation methods. Typically, two complementary synthetic 5'- 
phosphorylated oligonucleotides were annealed and ligated into a vector carrying 
the modified human trypsin I gene that was cleaved with the respective 
restriction enzymes. Oligonucleotides encoding SDR1 were inserted via the 
Sacll/BamHI sites whereas oligonucleotides encoding SDR2 were inserted via the 
Kpnl/Nhel sites. For each insertion an oligonucleotide pair according to the 
following general sequences was used ([P] indicating S'-phosphorylation, N and X 
indicating any nucleotide or amino acid residue, respectively): 
oligox-SDR1f (SEQ ID NO:64): 

5 1 - [P ] -GGGCCACTGCTAC NNNISTO 1 

oligox-SDR1r (SEQ ID NO:66): 

3 ' -CGCCCGGTGACGAT GNNNNNN^ [P ] -5 r 

G H C Y XXX XXX K S 



o!igox-SDR2f (SEQ ID NO:67): 

5 1 - [P ] — CAAGTGCCTCATCTCTGGCTGGGGCAAC NNl^ f 

oligox-SDR2r (SEQ ID NO:69): 

3 ' — CATGGTTCACGGAGTAGAGACCGACCCCGTT GNISrN^ -5 1 

KCLISGWGN XXX X X T 
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As an alternative to the above method, a PCR based method was used for the 
integration of random -sequences into the SDR1 and SDR2 insertion sites in the 
modified human trypsin I. For each SDR, one primer was used where the SDR 
region is fully randomized. Sequences of the primers were as follows (N = 
A/C/G/T, B= C/G/T, V= A/C/G): 
Primer SDR1 -mutnnb-forward (SEQ ID NO:70): 

5 " -TGGTATCCGCGGGCCACTGCTACNNBNNBNNBNNBNNBlSnS[BAAGTCCCGGATCCAGGTG-3 1 

Primer SDR2~m utnnb- reverse (SEQ ID NO:71): 

5 1 -GGCGCCAGAGCTAGCAGTVNIS^ » 

The codon NNB, or VNN in the reverse strand, allows all 20 amino acids to made, 
but reduces the probability of encoding a stop codon from 0.047 to 0.021 . 

As a further alternative, after identification of SDRs that lead to increased 
specificity, these SDRs were used as templates for further randomization. 
Thereby, random peptide sequences were inserted that were partially 
randomized at each position and partially identical at each position to the original 
sequence. 

As an example, random peptide sequences that have in approximately 1 of 3 
cases the template amino acid residue and in approximately 2 of 3 cases any 
other amino acid residue at each position were inserted into the two SDR 
insertion sites of the modified human trypsin L For this purpose, primers that 
contain at each nucleotide position of the SDR approximately 70% of the 
template bases and 30% of a mixture of the three other bases were used, 
With each primer pair a PCR was performed under standard conditions using the 
human trypsin I gene as template. The resulting DNA was purified using the 
QIAquick PCR Purification Kit following the suppliers' instructions and digested 
with Sacll and Nhel. After digestion the DNA was purified and ligated into the 
Sacll and Nhel digested and dephcsphorylayted vector. The ligation products 
were transformed into E. coli, amplified in LB containing the respective marker, 
and the plasm ids were purified using the Qiagen Plasmid Purification Kit following 
the suppliers' instructions. Resulting plasmids were transformed into B. subtilis 
cells. These cells were then separated to single cells, grown to clones, and after 
expression of the protease gene screened for proteolytic activity. 
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The following substrates were employed for screening for proteolytic activity 
(SEQ ID NOs:76 and 77): 
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Protease variants were screened on substrate B at complexities of 10 6 variants 
by confocal fluorescence spectroscopy. The substrate was a peptide biotinylated 
at the N-terminus and fluorescently labeled at the C-terminus. After incubation of 
the peptide with supernatant of cells expressing different variants of the 
protease, streptavidin is added and the samples are analysed by confocal 
fluorimetry. The low concentration of the peptide (20nM) leads to a preferential 
cleavage by proteases with a high k ca1 /KM value, i.e. proteases with high 
specificity towards the target sequence. 

Variants selected in the screening procedure were further evaluated for their 
specificity towards substrate B and closely related substrate A by measuring time 
courses of the proteolytic digestion and determining the rate constants which are 
proportional to the IWKm values. Clearly, compared to the human trypsin that 
was used as scaffold protein, the specific activity of variants 1 and 2 is shifted 
(SEQ ID NOs: 2 and 3, respectively) towards substrate B. Variant 3 (SEQ ID 
NO:4), on the other hand, serves as a negative control with similar activities as 
the human trypsin I. Sequencing of the genes of the three variants revealed the 
following amino acid sequences in the SDRs. 

Table 2 : Sequences of the two SDRs in three different variants selected for 
specific hydrolysis of substrate B (SEQ ID NOs: 78-83). 
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In a further experiment a pool of variants containing different numbers of SDRs 
per gene were screened for increased specificity using a mixture of the defined 
substrate and pepton as a competing substrate. Variants containing one or two 
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SDRs per gene have been analyzed further. As a measure for the specificity the 
activity in the peptide cleavage assay was compared with and without the 
presence of the competing substrate. The concentration of the competing 
substrate was 10mg/ml. Under these conditions, unspecific proteases show, 
compared to specific proteases, a stronger decrease in activity with increasing 
competitor concentrations (range between 0 and 1 0Omg/rnl). The ratio of 
proteolytic activity with and without substrate is a quantitative measure for the 
specificity of the proteases. Figure 9 shows the relative activities with and 
without competing substrate. Human trypsin I that was used as the scaffold 
protein and two variants, one containing only SDR2, and one containing both 
SDRs, were compared. The specificity of the variant with both SDRs is by a factor 
of 2.5 higher than that of the variant with SDR2 only, confirming that there is a 
direct relation between the number of SDRs and the quantitative specificity of 
resulting engineered proteolytic enzymes. 

Exampl e V: Generation of an engineered proteolytic enzyme that specifically 
inactivat es human TNF-alpha 

Human trypsin alpha I or a derivative comprising one or more of the following 
amino acid substitutions E56G; R78W; Y131F; A146T; C183R was used as 
protein scaffold for the generation of an engineered proteolytic enzyme with high 
specificity towards human TNF-alpha. The identification of SDR sites in human 
trypsin I or derivatives thereof was done as described above. Two insertion sites 
within the scaffold were choosen for SDRs. The protease variants containing two 
inserts with different sequences and also the human trypsin I itself with no 
inserts were expressed in a Bacillus subtilis cells. The variant protease cells were 
separated to single ceil clones and the protease expressing variants were 
screened for proteolytic activity on peptides with the desired target sequence of 
TNF-alpha. The activity of the protease variants was determined as the cleavage 
rate of a peptide with the desired target sequence of TNF-alpha in the absence 
and presence of competitor substrate. The specificity is expressed as the ratio of 
cleavage rates in the presence and absence of competitor (Fig. 14). 

Table 3 : Relative specificity of variants of engineered proteolytic enzymes with 
different SDR sequences in absence and presence of competitor substrate (SEQ 
ID NOs:84-95). 
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The antagonistic effect of three inventive protease variants on human TNF-alpha 
is shown in Figure 15. By the use of the variants, the induction of apoptosis is 
almost completely eliminated indicating the anti- inflammatory efficacy of the 
inventive proteases to initiate TNF-alpha break down. TNF-alpha has been 
incubated with concentrated supernatant from cultures expressing the variants i 
to iii for 2 hours. The resulting TNF-alpha has been incubated with non-modified 
cells for 4 hours. The effect of the remaining TNF-alpha activity was determined 
as the extent of apoptosis induction by detection of activated caspase-3 as 
marker for apoptotic cells. For the controls either no protease was added with 
the human TNF-alpha (dead cells) or buffer instead of human TNF-alpha (live 
cells) was used, respectively. An analogous experiment is shown in Figure 16 
using purified variant xiii. TNF-alpha was incubated with different concentrations 
of the purified inventive protease variant. 

To demonstrate the specificity of the inventive protease variants, proteins from 
human blood serum or purified human TNF-alpha have been incubated with 
human trypsin I or the inventive engineered proteolytic enzyme variants, 
respectively. Here, variant x corresponds to Seq ID No: 75 comprising the same 
SDRs as variant f, i.e. SDRs according to Seq ID No, 89 (SDR1) and 95 (SDR2) . 
Variants xi and xii correspond to derivatives thereof comprising the same SDR 
sequences. Remaining intact protein was was determined as a function of time. 
While the variants as well as human trypsin I digest human TNF-alpha, only 
trypsin shows activity on serum protein (Figure 17 a and b). This demonstrates 
the high TNF-alpha specificity of the inventive proteolytic enzymes and indicates 
their safety and accordingly their low side effects for therapeutic use. 

Example VI: Generation of a n engineered proteolytic enzvme that specifically 
hydrolysis human VEGF. 
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Human trypsin I was used as protein scaffold for the generation of an engineered 
proteolytic enzyme with high specificity towards human VEGF. The identification 
of SDR sites in human trypsin I was done as described above. Two insertion sites 
within the scaffold were choosen for SDRs. The protease variants containing two 
inserts with different sequences were expressed in Bacillus subtilis cells. The 
variant protease cells were separated to single cell clones and the protease 
expressing variants were screened as described above. The activity of the 
protease variants was determined as the rate of VEGF cleavage. 4|ig of 
recombinant human VEGF165 was incubated with 0.18 \ig of purified protease in 
PBS / pH 7.4 at room temperature. Aliquots were taken at the indicated time 
points and analysed on a polyacrylamide gel. The extend of cleavage was 
quantified by densitometric analysis of the bands. The activity is plotted over 
incubation time in Figure 18. Specific cleavage was controlled by further SDS 
polyacrylamide gel analyses. 



WO 2004/113522 



98 



PCT/EP2004/051173 



Claims 

1 . Use of a protease with defined specificity for a target substrate for preparing a 
medicament for the treatment of a specific disease related to said target 
substrate. 

2. The use according to claim 1 , wherein 

(I) the protease hydrolizes the target substrate and thereby 

(i) eliminates or reduces one or more biological activities or physico- 
chemical properties or pharmacological properties of the target 
protein, and/or 

(ii) activates or increases one or more biological activities or physico- 
chemical properties or pharmacological properties of the target 
protein, and/or 

(iii) adds one or more biological activities or physico-chemical properties 
or pharmacolocical properties to the target protein; 

and/or 

(II) the target substrate hydrolyzed by the protease is a soluble protein, in 
particular a cytokine, such as the TNF-superfamily proteins, interleukines, 
interferons, chemokines and growth factors; a hormone; a toxin; an enzyme, 
such as oxidoreductases, transferases, hydrolases, lyases, isom erases and 
ligases; a structural protein, such as collagens; or an immunoglobulin; or a 
membrane associated protein, in particular a single pass transmembrane protein; 
a multipass transmembrane protein, such as G- protein coupled receptors, ion 
channels and transporters; a lipid-anchored membrane protein or a GPI- 
anchored membrane protein. 

3. The use according to claim 1 or 2, wherein the target substrate hydrolyzed by 
the protease is a TNF-superfamily protein. 

4. The use according to claim 3, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human tumor necrosis 
factor-alpha (hTNF-a, SEQ ID NO:96) or related molecules of the same structure 
class, preferably the peptide bonds between positions 31/32, 32/33, 44/45, 
45/46, 87/88, 128/129, 130/131, 140/141 and/or 141/142, most preferably 
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between positions 31/32, 32/33 and/or 45/46 of hTNF-a or between analogous 
positions in said related molecules; and/or 

(i) the medicament is suitable for the treatment of rheumatoid arthritis, 
inflammatory bowel diseases, psoriasis, Crohn's disease, Ulcerative colitis, 
diabetes type II, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
multiple sclerosis, Systemic inflammatory response syndrome (SIRS), multiple 
organ dysfunction syndrome (MODS), eosinophilia, neurodegenerative disease, 
stroke, closed head injury, encephalitis, CNS disorders, asthma, rheumatoid 
arthritis, sepsis, vasodilation, intravascular coagulation, multiple organ failure or 
other diseases connected with hTNF-a. 

5. The use according to claim 4, wherein the protease is derived from a serine 
protease of the structural class S1 , and preferably wherein the protease is 
derived from human trypsin I (SEQ ID NO:1). 

6. The use according to claim 4, wherein the protease has the a sequence shown 
in SEQ ID NO: 74 or 75 or a derivative thereof and is capable of hydrolysing 
hTNF-a at positions 31/32 and/ or 32/33. 

7. The use according to claim 3, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human Tumor 
necrosis factor ligand superfamily member 5 (CD40-L) or related molecules of 
the same structural class, preferably the peptide bonds between positions 
117/118, 133/134, 145/146, 165/166, 200/201, 201/202, 207/208, 216/217 
and/or 243/244, most preferred between positions 133/134, 165/166, 201/202 
and/or 216/217 of CD40-L(SEQ ID NO:143), or between analogous positions in 
related molecules; and/or 

(ii) the medicament is suitable for the treatment of systemic lupus 
erythematosus, classical Hodgkin's Lymphoma (cHL) or other diseases connected 
with CD40-L. 

8. The use according to claim 3, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in human 
Macrophage migration inhibitory factor (hMIF) or related molecules of the same 
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structural class, preferably the peptide bonds between positions 16/17, 44/45, 
66/67, 73/74, 77/78, 88/89, 92/93 and/or 100/101, most preferred between 
positions 16/17 and/or 92/93 of hMIF (SEQ ID NO: 109) or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of inflammatory diseases, or 
other diseases connected with hMIF. 

9. The use according to claim 1 or 2, wherein the target substrate hydrolyzed by 
the protease is an interleukine. 

10. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in I nterleukin-1 beta (IL- 
1 6) or related molecules of the same structural class, preferably the peptide 
bonds between positions 24/25, 35/36, 46/47, 54/55, 74/75, 75/76, 76/77, 
77/78, 86/87, 88/89, 93/94, 94/95, 97/98 and/or 150/151, most preferred 
between positions 35/36, 75/76, 76/77, 88/89, 93/94, 94/95 and/or 150/151 of 
IL-16 (SEQ ID NO: 112), or between analogous positions in related molecules; 
and/or 

(Hi) the medicament is suitable for the treatment of diabetes, brain inflammation 
in cancer, arthritis, autoimmune and inflammatory diseases or other diseases 
connected with IL-16. 

1 1 . The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human Interleukin 2 
(hlL-2) or related molecules of the same structural class, preferably the peptide 
bonds between positions 20/21, 32/33, 38/39, 43/44, 45/46 48/49, 49/50, 
54/55, 64/65, 76/77, 83/84, 84/85, 107/108, 109/110 and/or 120/121, most 
preferred between positions 109/110 of hlL-2 (SEQ ID NO: 99) or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of T-cell leukemia, hairy cell 
leukemia, Crohn's disease, Ulcerative colitis, Grave's disease, Hashimoto's 
thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, multiple 
sclerosis, asthma and chronic obstructive pulmonary, classical Hodgkin's 
Lymphoma (cHL) or other diseases connected with hlL-2. 
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12. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human Interleukin 3 
(hll_-3) or related molecules of the same structural class, preferably peptide 
bonds between positions 21/22, 28/29, 36/37, 44/45, 46/47, 51/52, 63/64, 
66/67, 79/80, 94/95, 101/102, 108/109 and/or 109/110, most preferred 
between positions 21/22, 28/29, 46/47, 63/64, 66/67, 79/80 and/or 101/102 of 
hlL-3 (SEQ ID NO: 148), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), eosinophilia or other diseases connected with hlL-3. 

13. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 4 (hIL-4) or related molecules of the same structural 
class, preferably the peptide bonds between positions 4/5, 12/13, 
31/32, 37/38, 61/62, 62/63, 64/65, 91/92, 102/103, 121/122 
and/or 126/127, most preferred between positions 4/5, 61/62, 
62/63, 64/65 and/or 121/122 of hIL-4 (SEQ ID NO:118), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), Grave's disease, Hashimoto's thyroiditis, 
Sjogren's syndrome, Asthma, chronic obstructive pulmonary 
disease, allergic inflammatory reactions or other diseases connected 
with hIL-4. 

14. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
lnterleukin-5 (hlL-5) or related molecules of the same structural 
class, preferably the peptide bonds between positions 12/13, 32/33, 
67/68, 76/77, 77/78, 80/81, 83/84, 84/85, 85/86, 90/91, 91/92, 
92/93 and/or 98/99, most preferred between positions 90/91, 
91/92, 92/93 and/or 98/99 of hlL-5 (SEQ ID NO:133), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), asthma, chronic obstructive pulmonary disease, 
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eosinophilia, allergic inflammatory diseases or other diseases 
connected with hlL-5. 

15. The use according to claim 9, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

lnterleukin-6 (hlL-6) or related molecules of the same structural 
class, preferably the peptide bonds between positions 32/33, 35/36, 
55/56, 71/72, 129/130, 130/131, 132/133, 135/136, 141/142, 
161/162, 180/181 and/or 183/184, most preferred between 
positions 135/136 and/or 141/142 of hlL-6 (SEQ ID NO: 134), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 

Lymphoma (cHL), breast cancer, renal cell carcinoma, multiple 
myeloma, lymphoma, leukemia, Grave's disease, Hashimoto's 
thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
Systemic inflammatory response syndrome (SIRS) which leads to 
distant organ damage and multpile organ dysfunctionsyndrome 
(MODS), chronic obstructive pulmonary disease (COPD), 
Castleman's diseases, inflammatory bowel diseases, Crohn's disease 
or other diseases connected with hlL-6. 

16. The use according to claim 9, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Interleukin 8 (hlL-8) or related molecules of the same structural 
class, preferably the peptide bonds between positions 1 1/1 2, 15/16, 
45/46, 47/48, 52/53, 54/55, 60/61, 64/65 and/or 67/68, most 
preferred between positions 45/46 of hlL-8 (SEQ ID NO:100), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 

Ulcerative colitis, classical Hodgkin's Lymphoma (cHL), Systemic 
inflammatory response syndrome (SIRS) which leads to distant 
organ damage and multpile organ dysfunction syndrome (MODS), 
chronic obstructive pulmonary disease (COPD), endometriosis, 
psoriasis, atherosclerotic lesions or other diseases connected with 
hlL-8. 
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17. The use according claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
lnterleukin-10 (hl!_-10) or related molecules of the same structural 
class, preferably the peptide bonds between positions 24/25, 25/26, 
27/28, 28/29, 40/41, 44/45, 49/50, 57/58, 59/60, 84/85, 86/87, 
106/107, 107/108, 110/111, 130/131, 134/135, 137/138, 138/139 
and/or 144/145, most preferred between positions 24/25, 27/28, 
44/45, 49/50, 86/87, 137/138 and/or 144/145 of hlL-10 (SEQ ID 
NO: 135), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), diseases related to the suppression of cytotoxic 
T-celis or other diseases connected with ML- 10. 



18. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 12 beta chain (hlL-1213) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 14/15, 18/19, 29/30, 34/35, 87/88, 99/100, 102/103, 
104/105, 161/162, 174/175, 222/223, 225/226, 228/229, 238/239, 
268/269 and/or 293/294, most preferred between positions 18/19, 
34/35, 87/88 and/or 161/162 of hlL-126 (SEQ ID NO:97) or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable fcr the treatment of Crohn's disease, 
classical Hodgkin's Lymphoma (cHL) or other diseases connected 
with ML-1213. 



19. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Interleukin 13 (hlL-13) or related molecules of the same structural 
class, preferably the peptide bonds between positions 25/26, 62/63, 
65/66, 86/87, 87/88, 98/99, 108/109 and/or 111/112, most 
preferred between positions 87/88 of hlL-13 (SEQ ID NO: 11 9), or 
between analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of cancer, classical 

Hodgkin's Lymphoma (cHL), eosinophilia, asthma, chronic 
obstructive pulmonary disease, fibrosis, psoriasis, atopic dermatitis, 
Ulcerative colitis or other diseases connected with hlL-13. 

20. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 18 (hlL-18) or related molecules of the same structural 
class, preferably the peptide bonds between positions 17/18, 32/33, 
37/38, 39/40, 40/41, 53/54, 58/59, 79/80, 90/91, 93/94, 98/99, 
110/111, 120/121, 123/124, 131/132, 132/133, 142/143, 147/148 
and/or 157/158, most preferred between positions 37/38, 132/133, 
142/143 and/or 157/158 of hlL-18 (SEQ ID NO:98) or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
inflammation liver injuries, pulmonary tuberculosis, plural 
tuberculosis, rheumatoid arthritis or other diseases connected with 
hlL-18. 

21. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is an interferone. 

22. The use according to claim 21 , wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interferon-gamma (hi FN-?) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
2/3, 6/7, 13/14, 21/22, 24/25, 34/35, 36/37, 37/38, 62/63, 68/69, 
83/84, 86/87, 90/91, 102/103, 107/108 and/or 108/109, most 
preferred between positions 13/14, 24/25, 37/38, 62/63, 68/69, 
102/103 and/or 107/108 of hi FN-? (SEQ ID NO: 137), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), Crohn's disease, type I diabetes or other diseases 
connected with I FN-?. 
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23. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a chemokine. 

24. The use according to claim 23, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human small 
inducible cytokine A2 (hCCL2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
3/4, 13/14, 18/19, 19/20, 24/25, 29/30, 38/39, 54/55, 56/57, 
58/59, 62/63, 65/66 and/or 68/69, most preferred between 
positions 19/20, 29/30, 38/39, 54/55 and/or 62/63 of hCCL2 (SEQ 
ID NO: 102), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
Ulcerative colitis, or other diseases connected with hCCL2. 

25. The use according to claim 23, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Eotaxin (hCCL11) or related molecules of the same structural class, 
preferably the peptide bonds between positions 11/12, 16/17, 
17/18, 22/23, 27/28, 33/34, 44/45, 47/48, 48/49, 52/53, 54/55, 
56/57, 60/61, 66/67, and/or 73/74, most preferred between 
positions 48/49 and/or 66/67 of hCCLI 1 (SEQ ID NO: 101), or 
between analogous positions in related molecules; and/or 

(ii) wherein the medicaments is suitable for the treatment of Crohn's 
disease and Ulcerative colitis, classical Hodgkin's Lymphoma (cHL), 
chronic pathophysiologic dysfunction, characterized by an influx 
mainly of Th2 cells, eosinophilia or other diseases connected with 
hCCL1 1 . 

26. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a growth factor. 

27. The use according to claim 26, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Vascular endothelial growth factor (hVEGF) or related molecules of 
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the same structural class, preferably the peptide bonds between 
positions 16/17, 19/20, 23/24, 34/35, 41/42, 56/57, 62/63, 63/64, 
64/65, 65/66, 82/83, and/or 84/85, most preferred between 
positions 23/24, 41/42, 63/64, 82/83 and/or 84/85 of hVEGF (SEQ 
ID NO: 103), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of all solid tumors and 

metastatic solid tumors, inflammatory breast cancer or other 
diseases connected with hVEGF, 



28. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Transforming growth factor beta 1 (hTGF-131) or related molecules 
of the same structural class, preferably the peptide bonds between 
positions 23/24, 25/26, 26/27, 27/28, 37/38, 55/56 and/or 94/95, 
most preferred between positions 25/26, 55/56 and/or 94/95 of 
hTGF- 131 (SEQ ID NO: 104), or between analogous positions in 
related molecules; and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 
including breast cancer, colorectal cancer and classical Hodgkin's 
Lymphoma (cHL); fibrosis, suppression of cell-mediated immunity, 
glaucoma, diffuse systemic sclerosis or other diseases connected 
with hTGF-131. 



29. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Somatotropin (hGrowth hormone; hGH) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 8/9, 16/17, 19/20, 26/27, 33/34, 38/39, 41/42, 70/71, 
77/78, 94/95, 103/104, 112/1 13, 115/116, 116/117, 130/131, 
147/148, 154/155 and/or 178/179, most preferred between 
positions 112/113, 147/148 and/or 154/155 of GH (SEQ ID 
NO: 121), or between analogous positions in related molecules; 
and/or 
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(ii) the medicament is suitable for the treatment of acromegaly, 

diabetes and diabetic kidney disease including renal hypertrophy 
and glomerular enlargement, cardiovascular disorders or other 
diseases connected with hGH. 

30. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in Insulin-like 
growth factor II (hIGF-ll) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
15/16, 23/24, 24/25, 34/35, 37/38, 38/39, 48/49 and/or 49/50, 
most preferred between positions 23/24 of hIGFII (SEQ ID 
NO: 122), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of diabetes, diabetic 
kidney disease or other diseases connected with hIGF-ll. 

31 . The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Hepatocyte growth factor (hHGF) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
54/55, 60/61, 62/63, 63/64, 68/69, 76/77, 112/1 13, 123/124, 
134/135, 168/169, 198/199 and/or 202/203, most preferred 
between positions 63/64, 68/69, 76/77, 168/169 and/or 202/203 of 
hHGF (SEQ ID NO: 120), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of angiogenic 
disorders, hepatocellular carcinoma or other diseases connected 
with hHGF. 

32. The use according to claim 1 or 2, wherein the target substrate h/drolyzed 
by the protease is soluble hormone. 

33. The use according to claim 32, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Insulin (hlnsulin) or related molecules of the same structural class, 
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preferably the peptide bonds between positions 16/17 and/or 22/23 
of hlnsulin B chain (SEQ ID NO: 105), and/or between positions 
14/15 of Insulin A chain (SEQ ID NO: 106) or between analogous 
positions in related molecules; and/or 
(ii) the medicament is suitable for the treatment of insulin overdosage 

or other diseases connected with hlnsulin. 

34. The use according to claim 32, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in 
human Ghrelin (Ghrelin) or related molecules of the same structural 
class, preferably the peptide bonds between positions 1/2, 2/3, 3/4 
and/or 4/5 of hGhrelin (SEQ ID NO: 107), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of obesity or other 
diseases connected with hGhrelin. 

35. The use according to claim 32, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

angiotensinogen (angiotensin) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
1/2, 3/4, and/or 7/8, most preferred between positions 3/4 of 
angiotensin {SEQ ID NO: 108), or between analogous positions in 
related molecules; and/or 

(") the medicament is suitable for the treatment of essential 

hypertension and other diseases connected with angiotensin. 

36. The use according to claim 32, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in 

human leptin (leptin) or related molecules of the same structural 
class, preferably the peptide bonds between positions 8/9, 9/10, 
15/16, 23/24, 40/41, 53/54, 71/72, 85/86, 94/95, 108/109 and/or 
141/142, most preferred between positions 9/10, 40/41, 71/72, 
94/95 and/or 108/109 of leptin (SEQ ID NO:127), or between 
analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of obesity or other 

diseases connected with leptin. 

37. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a binary toxin. 

38. The use according to claim 37, wherein 

(i) the engineered protease is capable of hydroiysing peptide bonds in 
Protective antigen precursor (PA-83) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 72/73, 73/74, 92/93, 93/94, 131/132, 149/150, 178/179, 
213/214, 214/215, 387/388, 425/426, 426/427, 427/428, 453/454, 
520/521, 608/609, 617/618, 671/672, 679/680, 680/681, 683/684 
and/or 684/685, most preferred between positions 72/73, 73/74, 
93/94, 149/150, 387/388, 425/426, 427/428 and/or 683/684 of 
hPA-83 (SEQID NO: 123), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of anthrax infection or 
other diseases connected with PA-83, 

39. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a protease, in particular a serine, cysteine or -metallo protease. 

40. The use according to claim 39, wherein 

(i) the protease is capable of hydroiysing peptide bonds in human 
plasminogen (plasminogen) or related molecules of the same 
structural class, preferably the peptide bonds between position 
580/581 of plasminogen (SEQ ID NO: 140), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with plasminogen. 

41 . The use according to claim 39, wherein 

(0 the protease is capable of hydroiysing peptide bonds in human 

Prothrombin (thrombin) or related molecules of the same structural 
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class, preferably the peptide bonds between positions 198/199, 
327/328, 363/364, most preferred between positions 327/328 
and/or 363/364 of thrombin (SEQ ID NO: 149), or between 
analogous positions in related molecules; and/or 
(ii) the medicament is suitable for the treatment of bleeding or other 

diseases connected with thrombin. 

42. The use according to claim 39, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human beta- 
secretase or related molecules of the same structural class, 
preferably the peptide bonds between positions 61/62, 64/65, 
130/131, 131/132, 159/160, 216/217, 238/239, 239/240, 246/247, 
256/257, 259/260, 330/331, 365/366, 378/379 and/or 381/382, 
most preferred between positions 61/62, 131/132, 246/247, 
259/260, 365/366 and/or 378/379 of beta-secretase (SEQ ID 
NO: 139), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Alzheimer or other 
diseases connected with beta-secretase precursor. 

43. The use according to claim 39, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
matrix metalloproteinase-2 (hMMP-2) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 62/63, 68/69, 75/76, 76/77, 79/80, 88/89, 110/111, 
112/113, 115/116, 120/121, 164/165, 254/255, 267/268, 296/297, 
324/325, 325/326, 382/383, 383/384, 470/471, 500/501, 550/551, 
564/565, 595/596, 597/598, 608/609, 646/647, 649/650 and/or 
650/651, most preferred between positions 68/69, 115/1 16, 
120/121, 164/165, 325/326, 383/384, 470/471, 500/501, 595/596, 
608/609 and/or 650/651 of hMMP-2 (SEQ ID NO: 131), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer, lung 
cancer or other diseases connected with hMMP-2. 
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44. The use according to claim 39, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

matrix metalloproteinase-9 (hMMP-9) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 41/42, 42/43, 106/107, 113/114, 134/135, 160/161, 
162/163, 163/164, 222/223, 226/227, 265/266, 266/267, 267/268, 
284/285, 309/310, 321/322, 322/323, 324/325, 356/357, 380/381, 
433/434 and/or 440/441, most preferred between positions 
160/161, 163/164, 226/227, 284/285, 321/322, 322/323 and/or 
433/434 of hMMP-9 (SEQ ID NO: 132), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 

including bladder cancer, breast tumor cancer, gastric cancer, lung 
cancer or other diseases connected with hMMP-9. 

45. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a type-1 single pass transmembrane protein. 

46. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in HIV 
membrane glycoprotein (GP120) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
97/98, 99/100, 107/108, 113/114, 117/118, 227/228, 231/233, 
279/280, 335/336, 337/338, 368/369, 412/413, 419/420, 429/430, 
444/445, 457/458, 474/475, 476/477, 477/478, 485/486 and/or 
490/491, most preferred between positions 99/100, 368/369, 
412/413, 419/420, 444/445 and/or 490/491 of GP120 (SEQ ID 
NO: 124) or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of AIDS or HIV 
infection or other diseases connected with GP120. 

47. The use according to claim 45, wherein 
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(i) the protease is capable of hydrolysing peptide bonds human 
Cytotoxic T-lymphocyte protein 4 (CTLA-4) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 14/15, 28/29, 33/34, 38/39, 41/42, 62/63, 72/73, 85/86, 
95/96, 100/101, 105/106, 119/120, 125/126 and/or 127/128, most 
preferred between positions 14/15, 28/29, 38/39, 41/42, 62/63 
and/or 85/86 of CTLA-4 (SEQ ID NO: 144), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of breast cancer or 
other diseases connected with CTLA-4. 



48. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Integrin alpha-2 (hVLA-2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
160/161, 174/175, 201/202, 219/220, 231/232, 232/233, 233/234, 
243/244, 259/260, 264/265, 268/269, 288/289, 292/293, 294/295, 
298/299, 301/302, 310/31 1 and/or 317/318, most preferred 
between positions 160/161, 174/175, 201/202, 219/220, 243/244, 
264/265, 292/293 and/or 294/295 of hVLA-2 (SEQ ID NO: 147), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of renal tumors, uveal 
melanomas, gastrointestinal tumors or other diseases connected 
with hVLA-2. 



49. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Vascular endothelial growth factor receptor 1 (hVEGFR 1) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 175/176, 180/181, 187/188, 189/190, 190/191, 
224/225 and/or 331/332, most preferred between positions 
189/190 and/or 331/332 of hVEGFR 1 (SEQ ID NO:114), or 
between analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of solid tumors and 

metastatic solid tumors, astrocytic brain tumors, pancreatic cancer, 
metastatic renal cancer or other diseases connected with hVEGFR 1 . 

50. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Vascular endothelial growth factor receptor 2 (hVEGFR 2) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 214/215, and/or 323/324, most preferred 
between positions 214/215 of hVEGFR 2 (SEQ ID NO:115), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of solid tumors and 

metastatic solid tumors pancreatic cancer, metastatic renal cancer, 
metastatic CRC, or other diseases connected with hVEGFR 2. 

51 . The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Epidermal growth factor receptor (hEGFr) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 20/21, 29/30,48/49, 74/75, 155/156, 165/166, 167/168, 
202/203, 206/207, 220/221, 223/224, 246/247, 251/252, 254/255, 
269/270, 270/271, 297/298, 304/305, 305/306, 357/358, 364/365, 
369/370, 430/431, 443/444, 454/455, 455/456, 463/464, 465/466, 
476/477, 507/508 and/or 509/510, most preferred between 
positions 155/156, 246/247, 251/252, 254/255, 269/270, 270/271, 
297/298, 304/305, 306/307, 364/365 and/or 454/455 of hEGFr 
(SEQ ID NO: 11 6), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of bladder cancer, 

breast cancer, cervical cancer, colorectal cancer, endometrial 
cancer, oesophageal cancer, head and neck cancer, gastric cancer, 
non-small-ceil lung carcinoma, ovarian cancer or other diseases 
connected with hEGFr. 

52. The use according to claim 45, wherein 
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(i) the protease is capable of hydrolysing peptide bonds in human 
Epithelial cell adhesion molecule (Ep^GAM) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 14/15, 19/20, 25/26, 29/30, 30/31, 33/34, 44/45, 55/56, 
67/68, 70/71, 90/91 and/or 100/101, most preferred between 
positions 14/15, 30/31, 44/45, 70/71 and/or 100/101 of Ep-CAM 
(SEQ ID NO: 125) or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of colorectal cancer or 
other diseases connected with Ep-CAM. 

53. The use according to claim 45, wherein 

(') th e protease is capable of hydrolysing peptide bonds in human 

Insulin-like growth factor I receptor (hlGF~1r) or related molecules 
of the same structural class, preferably the peptide bonds between 
positions 39/40, 59/60, 115/116, 132/133, 146/147, 171/172, 
191/192, 250/251, 262/263, 270/271, 290/291, 306/307, 307/308, 
335/336, 336/337, 403/404, 405/406, 455/456 and/or 470/471, 
most preferred between positions 39/40, 262/263, 306/307, 
307/308, 335/336, 405/406 and/or 470/471 of hlGF-1r (SEQ ID 
NO: 126) or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 

including breast cancer or other diseases connected with hlGF-1r. 

54. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human T- 

cell surface antigen CD2 (hCD2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
16/17, 20/21, 28/29, 29/30, 40/41, 42/43, 43/44, 48/49, 49/50, 
51/52, 54/55, 58/59, 63/64, 69/70, 76/77, 89/90 and/or 91/92, 
most preferred between positions 28/29, 40/41, 43/44, 51/52, 
76/77 and/or 89/90 of hCD2 (SEQ ID NO: 128) or between 
analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of psoriasis or other 

diseases connected with hCD2. 

55. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human T- 
cell surface glycoprotein CD 4 (hCD4) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 53/54, 63/64, 88/89, 166/167, 167/168, 173/174, 
206/207, 219/220, 224/225, 226/227, 230/231, 244/245, 251/252, 
252/253, 322/323, 329/330 and/or 334/335, most preferred 
between positions 88/89, 173/174, 206/207, 219/220, 230/231, 
251/252 and/or 252/253 of hCD4 (SEQ ID NO: 129), or between 
analogous positions in related molecules; 

(ii) preferably the medicament is suitable for the treatment of psoriasis, 
transplant rejection, graft-versus-host colitis, autoimmune 
disorders, rheumatoid arthritis or other diseases connected with 
hCD4. 

56. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Integrin alpha-L (hCD11a) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
145/146, 152/153, 156/157, 159/160, 160/161, 177/178, 178/179, 
189/190, 190/191, 191/192, 193/194, 197/198, 200/201, 221/222, 
229/230, 249/250, 253/254, 268/269, 290/291, 297/298, 304/305 
and/or 305/306, most preferred between positions 145/146, 
159/160, 160/161, 189/190, 229/230, 249/250, 268/269, 297/298, 
304/305 and/or 305/306 of hCD11a (SEQ ID NO: 130), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of psoriasis or other 

diseases connected with hCD1 1 a. 



57. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Interferon-gamma receptor alpha chain (hlFN-?-R1) or related 
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molecules of the same structural class, preferably the peptide bonds 
between positions 49/50, 52/53, 58/59, 62/63, 72/73, 76/77, 
106/107, 107/108, 116/117, 122/123, 174/175, 176/177, 179/180, 
215/216 and/or 222/223, most preferred between positions 49/50, 
72/73, 116/117, 122/123, 174/175 , 176/177 and/or 215/216 of 
hlFN-?-R1 (SEQ ID NO:136), or between analogous positions in 
related molecules; and/or 
(ii) the medicament is suitable for the treatment of classical Hodgkin's 

Lymphoma (cHL), type I diabetes or other diseases connected with 
hlFN-?-R1. 



58. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Platelet membrane glycoprotein llb/llla (GPIIb/llla) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 67/68, 76/77, 91/92, 129/130, 143/144, 
144/145, 179/180, 181/182, 198/199, 208/209, 209/210, 216/217, 
239/240, 261/262, 410/411, 434/435, 532/533, 556/557, 557/558, 
596/597, 597/598, 621/622, 650/651, 651/652, 661/662, 662/663 
and/or 689/690, most preferred between positions 67/68, 76/77, 
179/180, 261/262, 410/411, 434/435, 650/651, 662/663 and/or 
689/690 of GPIIb/llla (SEQ ID NO:141), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of unstable angina, 
carotid stenting, ischemic stroke, peripheral vascular diseases, 
angiogenesis-related diseases, disseminating tumors or other 
diseases connected with GPI I b/ 1 1 1 a. 



59. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Intercellular adhesion molecule-1 (IGAM-1) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 26/27, 40/41, 60/61, 71/72, 88/89, 97/98, 102/103, 
128/129, 131/132, 132/133, 149/150, 150/151, 151/152, 160/161, 
164/165 and/or 166/167, most preferred between positions 71/72, 
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88/89, 102/103, 150/151, 151/152, 160/161 and/or 166/167 of 
ICAM-1 (SEQ ID NO: 142), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease or 

other diseases connected with ICAM-1 . 

60. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds human TGF- 
beta receptor type II (hTGF-B Rll) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
32/33, 34/35, 35/36, 66/67, 67/68, 69/70, 82/83, 103/104, 
104/105, 105/106, 118/119, 122/123 and/or 130/131, most 
preferred between positions 32/33, 34/35, 66/67, 69/70, 104/105, 
122/123 and/or 130/131 of hTGF-B Rll (SEQ ID NO:145), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of diffuse systemic 
sclerosis or other diseases connected with hTGF- B Rll. 

61. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Membrane cofactor protein (hMCP) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
15/16, 17/18, 25/26, 27/28, 31/32, 32/33, 35/36, 47/48, 48/49, 
58/59, 67/68, 69/70, 70/71, 110/111, 119/120, 125/126 and/or 
130/131, most preferred between positions 15/16, 32/33, 47/48, 
48/49, 70/71, 1 19/120 and/or 125/126 of hMCP (SEQ ID NO:146), 
or between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of renal tumors, uveal 
melanomas, gastrointestinal tumors or other diseases connected 
with hMCP. 

62. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a protease-activated receptor or hydroxytryptamine receptor. 

63. The use according to claim 62, wherein 
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(i) the protease is capable of hydroiysing peptide bonds in human 
Protease activated receptor 1 (hPAR1) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 46/47, 50/51, 51/52, 52/53 and/or 58/59 of hPAR1 (SEQ 
ID NO: 110), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with hPAR1 . 

64. The use according to claim 62, wherein 

(0 the protease is capable of hydroiysing peptide bonds in human 

Protease activated receptor 2 (hPAR2) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 41/42, 44/45, 51/52, 59/60 and/or 62/63 of hPAR2 (SEQ 
ID NO:111), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 

Ulcerative colitis and Inflammatory bowel disease, asthma, 
inflammation associated pain and arthritis or other diseases 
connected with hPAR2. 

65. The use according to claim 62, wherein 

(i) the protease is capable of hydroiysing peptide bonds in human 
Protease activated receptor 4 (hPAR4) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 57/58, 59/60, 68/69, 74/75 and/or 78/79 of hPAR4 (SEQ 
ID NO: 11 3), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with hPAR4. 

66. The use according to claim 62, wherein 

(0 the protease is capable of hydroiysing peptide bonds in human 5- 

hydroxytryptamine 1A receptor (h5-HT-1A) or related molecules of 
the same structural class, preferably the peptide bonds between 
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positions 101/102, 102/103, 181/182 and/or 370/371 of h5-HT-1A 
(SEQ ID NO: 11 7) or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of irritable bowel 

syndrome or other diseases connected with h5-HT-1 A. 

67. The use according to claim 1 or 2, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
carcinoembryonic antigen (hCEA) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
17/18, 69/70, 71/72 5 74/75, 77/78, 98/99, 116/117, 126/127 
and/or 128/129 of hCEA (SEQ ID NO: 138), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of colon cancer or 
other diseases connected with hCEA. 

68. The use according to any one of claims 1 to 67, wherein the protease is an 
engineered protease, preferably an engineered protease characterized by a 
combination of the following components: 

(a) a protein scaffold capable to catalyze at least one chemical reaction on at 
least one target substrate and being derived from one or more proteins, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between at least one target substrate and one or more different substrates, and 
wherein the SDRs are essentially synthetic peptide sequences. 

69. The use according to claim 68, wherein 

(I) the SDRs (b) have a length between one and 50 amino acid residues, 
preferably have a length between 2 and 20 amino acid residues, more preferably 
a length between 2 and 10 amino acid residues, even more preferably a length 
between 3 and 8 amino acid residues, and wherein the number of SDRs is at 
least one, preferably more than one, more preferably between two and eleven, 
most preferably between two and six; and/or 

(II) the protein scaffold (a) is comprised of one or more polypeptide segments 
being derived from same or different 
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(i) proteins encoded by a gene of viral or prokaryotic or eukaryotic origin, and/or 

(ii) native enzymes, mutated variants or truncated derivates thereof, and/or 

(iii) mammalian enzymes, preferably human enzymes. 

70. The use according to claim 68 or 69, wherein the protein scaffold (a) is 
derived from a protease selected from the group consisting of aspartic, cysteine, 
serine, metailo and threonine proteases, 

even more preferably the protein scaffold (a) is derived from a serine protease of 
the structural class S1 , S8, S1 1 , S21, S26, S33 or S51, most preferably from 
class S1 or S8, a cysteine protease of the structure class C1 , C2, C4, C10, C14, 
C19, C47, C48 or C56, most preferably from class 014, or an aspartic protease of 
the structural class A1 , A2 or A26, most preferably from class A1 , or a 
metalloprotease of the strucutral class M4 or M1 0. 

71 . The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a serine protease of the 
structural class S1 ; and/or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 18-25, 38-48, 54-63, 73-86, 
122-130, 148-156, 165-171 and 194-204 in human trypsin I having 
the amino acid sequence shown in SEQ ID NO:1, and preferably at 
one or more positions selected from the group of positions that 
correspond structurally or by amino acid sequence homology to the 
regions 20-23, 41-45, 57-60, 76-83, 125-128, 150-153, 167-169 
and 197-201 in human trypsin I. 

72. The use according to claim 71, wherein 

(0 the protein scaffold (a) is derived from the serine protease trypsin, 

preferably human trypsin I having the amino acid sequence shown 
in SEQ ID NO:1, or the amino acid sequence SEQ ID NO:1 
comprising one or more of the following amino acid substitutions 
E56G, R78W, Y131F, A146Tand C183R; and 

(ii) at least one of two SDRs are located in the scaffold, a first SDR 

having a length of up to 6 amino acids and bein inserted between 
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residues 42 and 43, and a second SDR having a length of up to 5 
amino acids and bein inserted between residues 123 and 124 
(numbering relative to human trypsin having the amino acid 
sequence shown in SEQ ID NO:1). 

73. The use according to claim 72, wherein 

(i) one of the peptide sequences of the following group: SEQ ID NO: 
72, 78, 79, 80, 84, 85, 86, 87, 88, and 89 is inserted as the first 
SDR between residues 42 and 43; and/or one of the peptide 
sequences of the following group: SEQ ID NO: 73, 81, 82, 83, 90, 
91, 92, 93, 94, and 95 is inserted as the second SDR between 
residues 123 and 124; or 

(ii) the engineered enzyme comprises an amino acid sequence as 
shown in SEQ ID NO: 74, or SEQ ID NO: 75. 

74. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a serine protease of the 
structural class S8; and/or 

(ii) (ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 6-17, 25-29, 47-55, 59-69, 101 - 
111, 117-125, 129-137, 139-154, 158-169, 185-195 and 204-225 
in subtilisin E from Bacillus subtUis having the amino acid shown in 
SEQ ID NO: 7, and preferably at one or more positions selected from 
the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 59-69, 101-111, 129-137, 158- 
169 and 204-225in subtilisin E for Bacillus subtilis. 

75. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from an aspartic protease of the 
structural class A1 ; and/or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 6-18, 49-55, 74-83, 91 -97, 1 12- 
120, 126-137, 159-164, 184-194, 242-247, 262-267 and 277-300 
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in human pepsin having the amino acid sequence shown in SEQ ID 
NO:11, and more preferably at one or more positions selected from 
the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 10-15, 75-80, 114-118, 130- 
134, 186-191 and 280-296 in human pepsin. 

76. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a cysteine protease of the 
structural class C14; and/or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 78-91, 144-160, 186-198, 226- 
243 and 271-291 in human caspase 7 having the amino acid 
sequence of SEQ ID NO: 14, and preferably at one or more positions 
selected from the group of positions that correspond structurally or 
by amino acid sequence homology to the regions 80-86, 149-157, 
1 90-1 94 and 233-238 of human caspase 7. 

77. The use according to any one of claims 1 to 76, wherein the protease 
comprises 

(i) at least one further proteinacious component, preferably being selected from 
the group consisting of binding domains, receptors, antibodies, regulation 
domains, pro-sequences, and fragments thereof, and/or 

(ii) at least one further functional component, preferably being selected from the 
group consisting of polyethylenglycols, carbohydrates, lipids, fatty acids, nucleic 
acids, metals, metal chelates, and fragments or derivatives thereof. 

78. The use according to any one of claims 1 to 77, wherein the protease is 
obtainable by a method comprising at least the following steps: 

(a) providing a protein scaffold which catalyzes at least one chemical reaction on 
at least one target substrate, 

(b) generating a library of enzymes or isolated enzymes by combining the 
protein scaffold from step (a) with variants of one or more fully or partially 
random peptide sequences at sites in the protein scaffold that enable the 
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resulting enzyme to discriminate between at least one target substrate and one 
or more different substrates, and 

(c) selecting out of the (library of) enzymes generated in step (b) one or more 
enzymes that have defined specificities towards at least one target substrate. 

79. Use of an enzyme according to any one of claims 1 to 78 for in vivo or in 
vitro diagnostic purposes. 

80. A pharmaceutical or diagnostic composition comprising one or more enzymes 
according to any one of claims 1 to 78, said pharmaceutical or diagnostic 
composition optionally comprising pharmaceutical^ or diagnostically acceptable 
carrier(s), excipient(s) and/or auxiliary agent(s). 

81. A method for cleaving a target substrate as defined in claims 1 to 67 in vivo 
or in vitro, which comprises contacting the target substrate with a protease as 
defined in claims 1 to 78. 

81 . A method for treatment of a disease in a patient connected with a specific 
target substrate as defined in anyone of claims 1 to 67, which comprises 
administering the patent a suitable amount of a protease with defined specificity 
for said specific target substrate as defined in anyone of claims 1 to 78. 
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Trypsin IVGGYNCEENSVPYQVSL NSGYHF-CGGSLINEQWWSAGHCY 

a-Thrombin IVEGSDAEIGMSPWQVMIjFRKSPQELL-CGASLISDRWVLTAAHCLLYPP 

Enteropeptidase IVGGSNAKEGAWPWWGL YYGGRLLCGASLVSSDWLVSAAHCVYGRN 

** * * * * ** ** * * ** 



Trypsin KSRIQVRLGEH NIEVLEGN-EQFINAAKIIRHPQYD-RKTL 

a-Thrombin WDKNFTENDLLVRIGKH SR TR Y ERN I E K I SMLEK I Y I HP R YNWRENL 

Enteropeptidase LE PSKWTAILGLHMKSNLTSPQTV— PRLID — EIVINPHYN-RRRK 

-1 * * * * * * 



Trypsin NNDIMLIKLSS RAVINARVS TI SLPTA PPAT — GTKCLISGWG 

a-Thrombin DRD I ALMKLKKP VAF S D Y I HP VCLPDR ETAASLLQAGYKGRVTGWG 

Enteropeptidase DND I AMMHLEFKVNYTD YI QP I CLPEENQVFPP - GRNCSIAGWG 

** * ** _, 2 * *** 



Trypsin N— TASSGADYPDELQCLDAPVLSQAKCEASYPG-KITSNMFCVGFL 

a-Thrombin NLKETWTANVGKGQPSVLQWNLPIVERPVCKDSTRI-RITDNMFCAGYK 

Enteropeptidase T WYQGTT— ANI LQEAD VPLLSNERCQQQMPE YNI TENMI CAGYE 

— 3 — * ** * * ****** 



Trypsin 

a-Thrombin 

Enteropeptidase 



— EGGK — DSCQGD S GGPWCNGQ L Q GWS WGDGCAQKNKP 

PDEGKRGDACEGDSGGPFVMKSP F NNRW Y QMG IMS WGE G CDRD GK Y 

-EGGI — DSCQGD SGGPLMCQENNRWFLA GVTSFGYKCALPNRP 



* * ****** 



Trypsin G V YTKV YN YVKW I KN T I AAN S - 
a-Thrombin GFYTHVFRLKKWIQKVIDQFGE 
Ent er opept idase GVYARVS RFTEWI QSFLH 

* * * ** 
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Fig. 3 
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sub 
f urin 
PC_SK1 
PC SK5 



IAHEYAQSV — PY GISQ— IKAPALHSQGY 

VAKRRAKRD — VYQEPTDPKFPQQWYLSGVTQRDLNVKEAWAQGF 

ALNL — FNDPMWNQQWYLQDTRMTAALPKLDL 



EKERSKRS ALRDS - 



NTHPCQ 



-SD — MNIEGAWKRGY- 



SUb -TGSNVKVAVIDSGIDSSHPDL — NVRGGAS FVPSETN P 

furin TGHGIWSILDDGIEKNHPDLAGNYDPGAS FDVNDQD PDPQ 

PC_SKl HVIPVWQKGITGKGWITVLDDGLEWNHTDIYANYDPEASYDFNDNDHD P 

PC_SK5 — TGKNIWTILDDGIERTHPDL MQNYDA- -LASCDVNGNDLDPMP 

* * * * 3 , 



sub YQ DGSS HGTHVAGTI A— — AL— NNS IGVLGVSPSASLYAVKVLDS 

furin PRYTQM NDNR HGTRCAGEVA AVANN G VC G VG VA YNAR I G GVRMLD 

PC_SK1 FPRYDP TNENK HG TRC AGE I AMQ AN— NHKC G V— G VA YN S K VG G I RMLD G 

PC_SK5 RY DASNENKHGTRCAGEVA AAANNS HC T VG I AF NAKI G GVRMLD GDVTD 

■ 4 — ******* * , 



-GPT — GSTA LKT— - 

-GPEDDGKTVDGPARLAEE 



sub -TGSGQYSWI INGIE-WAI SNNMDVINMSLG 

furin GEVTDAVE ARS -LGLNPNH I HI YS AS W — 

PC__SK1 — IVTDAIEAS SIGFN PGHVD IYSASWGPNDDGKTVEGP GRLA QKAFE 

PC_SK5 MVEAKSVSFNPQHVHIYSASWGPDDDGKTVD GPA — PLT RQ 

"5 6 7 8- 



sub — WDKAVSSG — IWAAAAGNEGS S — GS TSTVGYP AKYP S T I AVGAV 

furin AFFRGVSQGRGGLGSIFVWASGNGGREHDSCNCDGYTNSI-YTLSISSATQFGNV 

PC__SK1 YGVKQGRQGKG S I F VWAS GNG GRQ GDNCDCD GYTDSI YTISI 

PC_SK5 — AFENGVRMGRRGLGSVFVWASGNGGRSKDHCS CDGYTWS I-YTI S I S S TAES GKKPWY 
8 * 9 



sub — N SSNQR ASFSSAG— SELDVMAPGVSIQSTLPGGTYGAY 

furin — PWYSEACSSTLA TTYSSGNQNEKQIVTTDLRQKCT ESH 

PC__SK1 — S SASQQGLSPWYAEKCSSTLATS YSSG— DYTDQRITSADLHNDCT ETH 

PC_SK5 LEE— CSSTL ATT YSSG -ES YDKKI ITTDLRQRCTDNH 



sub 
furin 
PC_SK1 
PC„SK5 



NGTSMATPHVAGAAALIL — SKHP — T Wf TN AQ VRDRL E S TAT Y — LG-NSFYYGKGLINV 
TGTSASAPLAAGI IALTLEANKNL — TWRDMQHLWQTSKPAH — LW— ADDWATNGVGRK 
TGTS AS APLAAG IF ALAL — EANP — NLTWRDMQHLWWTSEYDPLA-NNPGWKKNGAGL 
TGTSASAPMAAG 1 1 ALAL — EANPFLTWRDVQHVIVRTSRAGH — LNAND WKTN AAGFK V 
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TLVDEQP LENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSL — ACTN 

EMVDN LRGK S GQ G Y Y VEMT VG SPPQTLNI LVD T G S S NF AVG A AP H P F L 

PAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTWFDTGSSNLWVPSIHCKLLDIACWI 

* 1 * * * * ****** * 2 



HNRFNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQV— G GISDTN 

HRY YQRQLS S TYRDLRKGV YVP YTQGKWEGELGTDLVS I PHGP NVTVRA 

HHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALG GVKVER 

- **** 3 * * * , _4 



QIFGLSETEPGSFLYYAPFDGILGLAYPSIS — S SGATP VFDN I WNQGLVS QDLF S V YL S 
NIAAITESDK-FFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVP-NLFSLQLC 
QWGEATKQPGITFIAAKFDGILGMAYPRIS — VNNVLP VFDNLMQQRLVD QN IF SF YL S 
5 **** 5 ** * * ** * 



ADD KS — GSWIFGGIDSS YYTGSLNWVPVTVEGYWQITVD SITMNGETI 

GAGFPLNQSEVLASV — GGSMI I GGIDHS L YTGS LWYTP IRRE WYYEVT I VRVE INGQDL 

RDP DAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT 

■ 7 ** * * *** „g 



A — CAEGC — QAIVDTGTSLLTGPTSPIANIQSDIGASENSD— GDMWSCSAI 

KMD CKE YN YDKS I VD S G T TN LRLP KKVF E AAVK S I KAA S S T EKFP D GF WLGE QL V— C W Q A 
L — CKE'GC — EAI VDTGT S LMVGPVDEVRELQKAI GAVP LI Q GEYMIPCEKV 

* * *** ** * * * , C) * 



SSLPDIVFTI NGVQYPVPPSAYILQSEGS CISGFQGMNVP— TESG 

GTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDV ATS QDDC YKFAI SQSS 

STLPAITLKL GGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP—PP SG 

* — 10 * * 11 



ELWILGDVFIRQYFTVFDRANNQVGLAPVA 
T GTVMGAVI ME GF YVVFDRARKRI GFAVSA 
P LW I LGDVF I GRY YT VFDRDNNRVGF AE AA 
— * * **** * * * 
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Fig. 7 
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01 MLEADDQGCI EEQGVEDSAN EDSVDAKPDR SSFVPSLFSK KKKNVTMRSI KTTRDRVPTY 



61 QYNMHFEKLG KCIIINNKNF DKVTGMGVRN GTDKDAEALF KCFRSLGFDV IVYNDCSCAK 

— 1 

121 MQDLLKKASE EDHTNAACFA CILLSHGEEN VIYGKDGVTP IKDLTAHFRG DRSKTLLEKP 

2 

181 KLFFIQACRG TELDDGIQAD SGPINDTDAN PRYKIPVEAD FLFAYSTVPG YYSWRSPGRG 

2 41 SWFVQALCSI LEEHGKDLEI MQI LTRVNDR VARHFESQSD DPHFHEKKQI PCWSMLTKE 
5 

301 LYFSQ 



Fig. 8 
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Fig. 12 
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trypsin SDR2 SDR1 & SDR2 
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Fig. 16 
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SEQUENCE LISTING 



<110> DIREVO Biotech AG 

<12 0> NEW BIOLOGICAL ENTITIES AND THE USE THEREOF 

<13 0> 0414 81wo/JH/ml 

<160> 126 

<170> Patentln version 3.1 



<210> 1 

<211> 224 

<212> PRT 

<213> Homo sapiens 



<400> 1 



lie 


Val 


Gly 


Gly 


Tyr 


Asn 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 








5 










10 










15 




Ser 


Leu 


Asn 


Ser 


Gly 


Tyr 


His 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 


He 


Asn 


Glu 








20 










25 










30 






Gin 


Trp 


Val 


Val 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Lys 


Ser 


Arg 


He 


Gin 


Val 






35 










40 










45 








Arg 


Leu 


Gly 


Glu 


His 


Asn 


lie 


Glu 


Val 


Leu 


Glu 


Gly 


Asn 


Glu 


Gin 


Phe 




50 










55 










60 










lie 


Asn 


Ala 


Ala 


Lys 


lie 


lie 


Arg 


His 


Pro 


Gin 


Tyr 


Asp 


Arg 


Lys 


Thr 


65 










70 










75 










80 


Leu 


Asn 


Asn 


Asp 


lie 


Met 


Leu 


He 


Lys 


Leu 


Ser 


Ser 


Arg 


Ala 


Val 


He 










85 










90 










95 




Asn 


Ala 


Arg 


Val 


Ser 


Thr 


lie 


Ser 


Leu 


Pro 


Thr 


Ala 


Pro 


Pro 


Ala 


Thr 








100 










105 










110 






Gly 


Thr 


Lys 


Cys 


Leu 


lie 


Ser 


Gly 


Trp 


Gly 


Asn 


Thr 


Ala 


Ser 


Ser 


Gly 






115 










120 










125 








Ala 


Asp 


Tyr 


Pro 


Asp 


Glu 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Pro 


Val 


Leu 


Ser 




130 










135 










140 










Gin 


Ala 


Lys 


Cys 


Glu 


Ala 


Ser 


Tyr 


Pro 


Gly 


Lys 


He 


Thr 


Ser 


Asn 


Met 


145 










150 










155 










160 


Phe 


Cys 


Val 


Gly 


Phe 


Leu 


Glu 


Gly 


Gly 


Lys 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 










165 










170 










175 




Ser 


Gly 


Gly 


Pro 


Val 


Val 


Cys 


Asn 


Gly 


Gin 


Leu 


Gin 


Gly Val 


Val 


Ser 








180 










185 










190 






Trp 


Gly 


Asp 


Gly 


c ys 


Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 






195 










200 










205 








Val 


Tyr 


Asn 


Tyr 


Val 


Lys 


Trp 


He 


Lys 


Asn 


Thr 


He 


Ala 


Ala 


Asn 


Ser 




210 










215 










220 











<210> 2 

<211> 235 

<212> PRT 

<213> artificial sequence 
<220> 

<223> trypsin variant 1 



<400> 2 



He 


Val 


Gly 


Gly 


Tyr 


Asn Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 








5 








10 










15 




Ser 


Leu 


Asn 


Ser 


Gly 


Tyr His 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 


He 


Asn 


Glu 








20 








25 










30 






Gin 


Trp 


Val 


Val 


Ser 


Ala Gly 


His 


Cys 


Tyr 


Asp 


Ala 


Val 


Gly 


Arg 


Asp 






35 








40 










45 








Lys 


Ser 


Arg 


He 


Gin 


Val Arg 


Leu 


Gly 


Glu 


His 


Asn 


lie 


Glu 


Val 


Leu 



50 55 60 
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Glu 


Gly 


Asn 


Glu 


Gin 


Phe 


He 


Asn 


Ala 


Ala 


Lys 


He 


He 


Arg 


His 


Pro 


65 










70 










75 










80 


Gin 


Tyr 


Asp 


Arg 


Lys 


Thr 


Leu 


Asn 


Asn 


Asp 


He 


Met 


Leu 


He 


Lys 


Leu 










85 










90 










95 




Ser 


Ser 


Arg 


Ala 


Val 


He 


Asn 


Ala 


Arg 


Val 


Ser 


Thr 


He 


Ser 


Leu 


Pro 








100 










105 










110 






Thr 


Ala 


Pro 


Pro 


Ala 


Thr 


Gly 


Thr 


Lys 


Cys 


Leu 


He 


Ser 


Gly 


Trp 


Gly 






115 










120 










125 








Asn 


Thr 


He 


Thr 


Asn 


Ser 


Thr 


Ala 


Ser 


Ser 


Gly 


Ala 


Asp 


Tyr 


Pro 


Asp 




130 










135 










140 










Glu 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Pro 


Val 


Leu 


Ser 


Gin 


Ala 


Lys 


Cys 


Glu 


145 










150 










155 










160 


Ala 


Ser 


Tyr 


Pro 


Gly Lys 


He 


Thr 


Ser 


Asn 


Met 


Phe 


Cys 


Val 


Gly 


Phe 










165 










170 










175 




Leu 


Glu 


Gly 


Gly 


Lys 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Val 








180 










185 










190 






Val 


Cys 


Asn 


Gly 


Gin 


Leu 


Gin 


Gly Val 


Val 


Ser 


Trp 


Gly 


Asp 


Gly 


Cys 






195 










200 










205 








Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly Val 


Tyr 


Thr 


Lys 


Val 


Tyr 


Asn 


Tyr 


Val 




210 










215 










220 










Lys 


Trp 


He 


Lys 


Asn 


Thr 


He 


Ala 


Ala 


Asn 


Ser 













225 230 235 



<210> 3 

<211> 235 

<212> PRT 

<213> artificial sequence 
<220> 

<223> trypsin variant 2 

<400> 3 



He 


Val 


Gly 


Gly 


Tyr 


Asn 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 








5 










10 










15 




Ser 


Leu 


Asn 


Ser 


Gly 


Tyr 


His 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 


He 


Asn 


Glu 








20 










25 










30 






Gin 


Trp 


Val 


Val 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Asn 


Gly 


Arg 


Asp 


Leu 


Glu 






35 










40 










45 








Lys 


Ser 


Arg 


He 


Gin 


Val 


Arg 


Leu 


Gly 


Glu 


His 


Asn 


He 


Glu 


Val 


Leu 




50 










55 










60 










Glu 


Gly 


Asn 


Glu 


Gin 


Phe 


He 


Asn 


Ala 


Ala 


Lys 


He 


He 


Arg 


His 


Pro 


65 










70 










75 










80 


Gin 


Tyr 


Asp 


Arg 


Lys 


Thr 


Leu 


Asn 


Asn 


Asp 


He 


Met 


Leu 


He 


Lys 


Leu 










85 










90 










95 




Ser 


Ser 


Arg 


Ala 


Val 


He 


Asn 


Ala 


Arg 


Val 


Ser 


Thr 


He 


Ser 


Leu 


Pro 








100 










105 










110 






Thr 


Ala 


Pro 


Pro 


Ala 


Thr 


Gly 


Thr 


Lys 


Cys 


Leu 


He 


Ser 


Gly 


Trp 


Gly 






115 










120 










125 








Asn 


Val 


Arg 


Gly 


Thr 


Trp 


Thr 


Ala 


Ser 


Ser 


Gly 


Ala 


Asp 


Tyr 


Pro 


Asp 




130 










135 










140 










Glu 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Pro 


Val 


Leu 


Ser 


Gin 


Ala 


Lys 


Cys 


Glu 


145 










150 










155 










160 


Ala 


Ser 


Tyr 


Pro 


Gly 


Lys 


He 


Thr 


Ser 


Asn 


Met 


Phe 


Cys 


Val 


Giy 


Phe 










165 










170 










175 




Leu 


Glu 


Gly 


Gly 


Lys 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Val 








180 










185 










190 






Val 


Cys 


Asn 


Gly 


Gin 


Leu 


Gin 


Gly Val 


Val 


Ser 


Trp 


Gly 


Asp 


Gly 


Cys 






195 










200 










205 








Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly Val 


Tyr 


Thr 


Lys 


Val 


Tyr 


Asn 


Tyr 


Val 




210 










215 










220 










Lys 


Trp 


He 


Lys 


Asn 


Thr 


He 


Ala 


Ala 


Asn 


Ser 
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WO 2004/113522 



3 



PCT/EP2004/051173 



<210> 4 

<211> 235 

<212> PRT 

<213> artificial sequence 
<220> 

<223> trypsin variant 3 



<400> 4 



lie 


Val 


Gly Gly 


Tyr 


Asn 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 








5 










10 










15 




Ser 


Leu 


Asn 


Ser 


Gly 


Tyr 


His 


Phe 


Cys 


Gly Gly 


Ser 


Leu 


He 


Asn 


Glu 








20 










25 










30 






Gin 


Trp 


Val 


Val 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Ala 


Ala 


Thr 


Asn 


Gly Asp 






35 










40 










45 








Lys 


Ser 


Arg 


He 


Gin 


Val 


Arg 


Leu 


Gly 


Glu 


His 


Asn 


He 


Glu 


Val 


Leu 




50 










55 










60 










Glu 


Gly 


Asn 


Glu 


Gin 


Phe 


He 


Asn 


Ala 


Ala 


Lys 


He 


He 


Arg 


His 


Pro 


65 










70 










75 










80 


Gin 


Tyr 


Asp 


Arg 


Lys 


Thr 


Leu 


Asn 


Asn 


Asp 


He 


Met 


Leu 


He 


Lys 


Leu 










85 










90 










95 




Ser 


Ser 


Arg 


Ala 


Val 


He 


Asn 


Ala 


Arg 


Val 


Ser 


Thr 


He 


Ser 


Leu 


Pro 








100 










105 










110 






Thr 


Ala 


Pro 


Pro 


Ala 


Thr 


Gly 


Thr 


Lys 


Cys 


Leu 


He 


Ser 


Gly 


Trp 


Gly 






115 










120 










125 








Asn 


Arg 


Lys 


Asp 


Phe 


Trp 


Thr 


Ala 


Ser 


Ser 


Gly Ala 


Asp 


Tyr 


Pro 


Asp 




130 










135 










140 










Glu 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Pro 


Val 


Leu 


Ser 


Gin 


Ala 


Lys 


Cys 


Glu 


145 










150 










155 










160 


Ala 


Ser 


Tyr 


Pro 


Gly 


Lys 


He 


Thr 


Ser 


Asn 


Met 


Phe 


Cys 


Val 


Gly 


Phe 










165 










170 










175 




Leu 


Glu 


Gly Gly 


Lys 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


G iy 


Gly 


Pro 


Val 








180 










185 










190 






Val 


Cys 


Asn 


Gly 


Gin 


Leu 


Gin 


Gly Val 


Val 


Ser 


Trp 


Gly 


Asp 


Gly 


Cys 






195 










200 










205 








Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 


Val 


Tyr 


Asn 


Tyr 


Val 




210 










215 










220 










Lys 


Trp 


He 


Lys 


Asn 


Thr 


He 


Ala 


Ala 


Asn 


Ser 













225 230 235 



<210> 5 

<211> 259 

<212> PRT 

<213> Homo sapiens 



<400> 5 



He 


Val 


Glu 


Gly 


Ser 


Asp 


Ala 


Glu 


He 


Gly 


Met 


Ser 


Pro 


Trp 


Gin 


Val 


1 








5 










10 










15 




Met 


Leu 


Phe 


Arg 


Lys 


Ser 


Pro 


Gin 


Glu 


Leu 


Leu 


Cys 


Gly Ala 


Ser 


Leu 








20 










25 










30 






He 


Ser 


Asp 
35 


Arg 


Trp 


Val 


Leu 


Thr 
40 


Ala 


Ala 


His 


Cys 


Leu 
45 


Leu 


Tyr 


Pro 


Pro 


Trp 
50 


Asp 


Lys 


Asn 


Phe 


Thr 
55 


Glu 


Asn 


Asp 


Leu 


Leu 
60 


Val 


Arg 


He 


Gly 


Lys 


His 


Ser 


Arg 


Thr 


Arg 


Tyr 


Glu 


Arg 


Asn 


He 


Glu 


Lys 


He 


Ser 


Met 


65 










70 










75 










80 


Leu 


Glu 


Lys 


He 


Tyr 
85 


He 


His 


Pro 


Arg 


Tyr 
90 


Asn 


Trp 


Arg 


Glu 


Asn 
95 


Leu 


Asp 


Arg 


Asp 


He 
100 


Ala 


Leu 


Met 


Lys 


Leu 
105 


Lys 


Lys 


Pro 


Val 


Ala 
110 


Phe 


Ser 


Asp 


Tyr 


He 
115 


His 


Pro 


Val 


Cys 


Leu 
120 


Pro 


Asp 


Arg 


Glu 


Thr 
125 


Ala 


Ala 


Ser 
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Leu 


Leu 


Gin 


Ala 


Gly 


Tyr Lys 


Gly Arg Val 


Thr 


Gly 


Trp 


Gly Asn 


Leu 




130 








135 










140 








Lys 


Glu 


Thr 


Trp 


Thr 


Ala Asn 


Val 


Gly Lys 


Gly 


Gin 


Pro 


Ser Val 


Leu 


145 










150 








155 








16 0 


Gin 


Val 


Val 


Asn 


Leu 


Pro lie 


Val 


Glu 


Arg 


Pro 


Val 


Cys 


Lys Asp 


Ser 










165 








170 








175 




Thr 


Arg 


lie 


Arg 


lie 


Thr Asp 


Asn 


Met 


Phe 


Cys 


Ala 


Gly 


Tyr Lys 


Pro 








18 0 








185 










190 




Asp 


Glu 


Gly 


Lys 


Arg 


Gly Asp 


Ala 


Cys 


Glu 


Gly 


Asp 


Ser 


Gly Gly 


Pro 






195 








200 










205 






Phe 


Val 


Met 


Lys 


Ser 


Pro Phe 


Asn 


Asn 


Arg 


Trp 


Tyr 


Gin 


Met Gly 


He 




210 








215 










220 








Val 


Ser 


Trp 


Gly 


Glu 


Gly Cys 


Asp 


Arg 


Asp 


Gly 


Lys 


Tyr 


Gly Phe 


Tyr 


225 










230 








235 








240 


Thr 


His 


Val 


Phe 


Arg 


Leu Lys 


Lys 


Trp 


lie 


Gin 


Lys 


Val 


lie Asp 


Gin 










245 








250 








255 




Phe 


Gly 


Glu 

























<210> 


6 




























<211> 


235 




























<212> 


PRT 




























<213> 


Homo 


sapiens 
























<400> 


6 




























He 


Val 


Gly 


Gly 


Ser 


Asn 


Ala 


Lys 


Glu 


Gly 


Ala 


Trp 


Pro 


Trp 


Val 


Val 


1 








5 










10 










15 




Gly 


Leu 


Tyr 


Tyr 


Gly 


Gly Arg 


Leu 


Leu 


Cys 


Gly 


Ala 


Ser 


Leu 


Val 


Ser 








20 










25 










30 






Ser 


Asp 


Trp 


Leu 


Val 


Ser 


Ala 


Ala 


His 


Cys 


Val 


Tyr 


Gly Arg Asn 


Leu 






3 5 










40 










45 








Glu 


Pro 


Ser 


Lys 


Trp 


Thr 


Ala 


He 


Leu 


Gly 


Leu 


His 


Met 


Lys 


Ser 


Asn 




50 










55 










60 










Leu 


Thr 


Ser 


Pro 


Gin 


Thr 


Val 


Pro 


Arg 


Leu 


He 


Asp 


Glu 


He 


Val 


He 


65 










70 










75 










80 


Asn 


Pro 


His 


Tyr 


Asn 


Arg 


Arg 


Arg 


Lys 


Asp 


Asn 


Asp 


He 


Ala 


Met 


Met 










85 










90 










95 




His 


Leu 


Glu 


Phe 


Lys 


Val 


Asn 


Tyr 


Thr 


Asp 


Tyr 


He 


Gin 


Pro 


He 


Cys 








100 










105 










110 






Leu 


Pro 


Glu 


Glu 


Asn 


Gin 


Val 


Phe 


Pro 


Pro 


Gly 


Arg 


Asn 


Cys 


Ser 


He 






115 










120 










125 








Ala 


Gly 


Trp 


Gly 


Thr 


Val 


Val 


Tyr 


Gin 


Gly 


Thr 


Thr 


Ala 


Asn 


He 


Leu 




130 










135 










140 










Gin 


Glu 


Ala 


Asp 


Val 


Pro 


Leu 


Leu 


Ser 


Asn 


Glu 


Arg 


Cys 


Gin 


Gin 


Gin 


145 










150 










155 










160 


Met 


Pro 


Glu 


Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met 


He 


Cys 


Ala 


Gly 


Tyr 


Glu 










165 










170 










175 




Glu 


Gly 


Gly 


He 


Asp 


Ser 


Cys 


Gin 


Gly Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Met 








180 










185 










190 






Cys 


Gin 


Glu 


Asn 


Asn 


Arg 


Trp 


Phe 


Leu 


Ala 


Gly 


Val 


Thr 


Ser 


Phe 


Gly 






195 










200 










205 








Tyr 


Lys 


Cys 


Ala 


Leu 


Pro 


Asn 


Arg 


Pro 


Gly 


Val 


Tyr 


Ala 


Arg 


Val 


Ser 




210 










215 










220 










Arg 


Phe 


Thr 


Glu 


Trp 


He 


Gin 


Ser 


Phe 


Leu 


His 












225 










230 










235 













<210> 7 

<211> 275 

<212> PRT 

<213> Bacillus subtilis 
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<400> 7 

lie Ala His Glu 

1 

Lys Ala Pro Ala 
20 

Val Ala Val lie 
35 

Val Arg Gly Gly 
50 

Asp Gly Ser Ser 
65 

Asn Asn Ser lie 

Ala Val Lys Val 
100 

He Asn Gly He 
115 

Met Ser Leu Gly 
130 

Asp Lys Ala Val 
145 

Glu Gly Ser Ser 

Pro Ser Thr He 
180 

Ser Phe Ser Ser 
195 

Ser He Gin Ser 
210 

Thr Ser Met Ala 
225 

Ser Lys His Pro 

Ser Thr Ala Thr 
260 

He Asn Val 
275 



Tyr Ala Gin Ser 
5 

Leu His Ser Gin 

Asp Ser Gly He 
40 

Ala Ser Phe Val 
55 

His Gly Thr His 
70 

Gly Val Leu Gly 
85 

Leu Asp Ser Thr 

Glu Trp Ala He 
120 

Gly Pro Thr Gly 
135 

Ser Ser Gly He 
150 

Gly Ser Thr Ser 
165 

Ala Val Gly Ala 

Ala Gly Ser Glu 
200 

Thr Leu Pro Gly 
215 

Thr Pro His Val 
230 

Thr Trp Thr Asn 
245 

Tyr Leu Gly Asn 



Val Pro Tyr Gly 
10 

Gly Tyr Thr Gly 
25 

Asp Ser Ser His 

Pro Ser Glu Thr 
60 

Val Ala Gly Thr 
75 

Val Ser Pro Ser 
90 

Gly Ser Gly Gin 
105 

Ser Asn Asn Met 

Ser Thr Ala Leu 
140 

Val Val Ala Ala 
155 

Thr Val Gly Tyr 
170 

Val Asn Ser Ser 
185 

Leu Asp Val Met 

Gly Thr Tyr Gly 
220 

Ala Gly Ala Ala 
235 

Ala Gin Val Arg 
250 

Ser Phe Tyr Tyr 
265 



He Ser Gin He 
15 

Ser Asn Val Lys 
30 

Pro Asp Leu Asn 
45 

Asn Pro Tyr Gin 

lie Ala Ala Leu 
80 

Ala Ser Leu Tyr 
95 

Tyr Ser Trp He 
110 

Asp Val He Asn 
125 

Lys Thr Val Val 

Ala Ala Gly Asn 
160 

Pro Ala Lys Tyr 
175 

Asn Gin Arg Ala 
190 

Ala Pro Gly Val 
205 

Ala Tyr Asn Gly 

Ala Leu He Leu 
240 

Asp Arg Leu Glu 
255 

Gly Lys Gly Leu 
270 



<210> 8 

<211> 320 

<212> PRT 

<213> Murinae gen. sp . 



<400> 8 



Val 


Ala 


Lys 


Arg 


Arg 


Ala 


Lys 


Arg 


1 








5 








Pro 


Lys 


Phe 


Pro 


Gin 


Gin 


Trp 


Tyr 








20 










Leu 


Asn 


Val 


Lys 


Glu 


Ala 


Trp 


Ala 






35 










40 


Val 


Val 


Ser 


He 


Leu 


Asp 


Asp 


Gly 




50 










55 




Ala 


Gly 


Asn 


Tyr 


Asp 


Pro 


Gly 


Ala 


65 










70 






Pro 


Asp 


Pro 


Gin 


Pro 


Arg 


Tyr 


Thr 










85 








Thr 


Arg 


Cys 


Ala 


Gly 


Glu 


Val 


Ala 








100 










Gly 


Val 


Gly 


val 


Ala 


Tyr 


Asn 


Ala 






115 










120 




Gly 


Glu 


Val 


Thr 


A sp 


Ala 


Val 




130 










135 





Asp 


Val 


Tyr 


Gin 


Glu 


Pro 


Thr 


Asp 




10 










15 




Leu 


Ser 


Gly 


Val 


Thr 


Gin 


Arg 


Asp 


25 










30 






Gin 


Gly 


Phe 


Thr 


Gly 


His 


Gly 


He 










45 








He 


Glu 


Lys 


Asn 


His 


Pro 


Asp 


Leu 








60 










Ser 


Phe 


Asp 


Val 


Asn 


Asp 


Gin 


Asp 






75 










80 


Gin 


Met 


Asn 


Asp 


Asn 


Arg 


His 


Gly 




90 










95 




Ala 


Val 


Ala 


Asn 


Asn 


Gly 


Val 


Cys 


105 










110 






Arg 


He 


Gly 


Gly 


Val 


Arg 


Met 


Leu 










125 








Glu 


Ala 


Arg 


Ser 


Leu 


Gly 


Leu 


Asn 



140 
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Pro Asn His lie 
145 

Gly Lys Thr Val 

Arg Gly Val Ser 
180 

Ala Ser Gly Asn 
195 

Tyr Thr Asn Ser 
210 

Gly Asn Val Pro 
225 

Thr Tyr Ser Ser 

Leu Arg Gin Lys 
260 

Pro Leu Ala Ala 
275 

Leu Thr Trp Arg 
290 

Ala His Leu Asn 
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10 










15 




Asp 


Ser 


Ala 


Asn 


Glu 


Asp 


Ser 


Val 


Asp 


Ala 


Lys 


Pro 


Asp 


Arg 


Ser 


Ser 








20 










25 










30 






Phe 


Val 


Pro 


Ser 


Leu 


Phe 


Ser 


Lys 


Lys 


Lys 


Lys 


Asn 


Val 


Thr 


Met 


Arg 






35 










40 










45 








Ser 


lie 


Lys 


Thr 


Thr 


Arg 


Asp 


Arg 


Val 


Pro 


Thr 


Tyr 


Gin 


Tyr 


Asn 


Met 




50 










55 










60 










Asn 


Phe 


Glu 


Lys 


Leu 


Gly 


Lys 


Cys 


He 


He 


He 


Asn 


Asn 


Lys 


Asn 


Phe 


65 










70 










75 










80 


Asp 


Lys 


Val 


Thr 


Gly 


Met 


Gly 


Val 


Arg 


Asn 


Gly 


Thr 


Asp 


Lys 


Asp 


Ala 










85 










90 










95 




Glu 


Ala 


Leu 


Phe 


Lys 


Cys 


Phe 


Arg 


Ser 


Leu 


Gly 


Phe 


Asp 


Val 


He 


Val 








100 










105 










110 






Tyr 


Asn 


Asp 


Cys 


Ser 


Cys 


Ala 


Lys 


Met 


Gin 


Asp 


Leu 


Leu 


Lys 


Lys 


Ala 






115 










120 










125 








Ser 


Glu 


Glu 


Asp 


His 


Thr 


Asn 


Ala 


Ala 


Cys 


Phe 


Ala 


Cys 


He 


Leu 


Leu 




130 










135 










140 










Ser 


His 


Gly 


Glu 


Glu 


Asn 


Val 


He 


Tyr 


Gly 


Lys 


Asp 


Gly 


Val 


Thr 


Pro 


145 










150 










155 










160 


lie 


Lys 


As P 


Leu 


Thr 


Ala 


His 


Phe 


Arg 


Gly 


Asp 


Arg 


Ser 


Lys 


Thr 


Leu 










165 










170 










175 




Leu 


Glu 


Lys 


Pro 


Lys 


Leu 


Phe 


Phe 


He 


Gin 


Ala 


Cys 


Arg 


Gly 


Thr 


Glu 








180 










185 










190 






Leu 


Asp 


Asp 


Gly 


He 


Gin 


Ala 


Asp 


Ser 


Gly 


Pro 


He 


Asn 


Asp 


Thr 


Asp 






195 










200 










205 








Ala 


Asn 


Pro 


Arg 


Tyr 


Lys 


He 


Pro 


Val 


Glu 


Ala 


Asp 


Phe 


Leu 


Phe 


Ala 




210 










215 










220 










Tyr 


Ser 


Thr 


Val 


Pro 


Gly 


Tyr 


Tyr 


Ser 


Trp 


Arg 


Ser 


Pro 


Gly 


Arg 


Gly 


225 










230 










235 










240 


Ser 


Trp 


Phe 


Val 


Gin 


Ala 


Leu 


Cys 


Ser 


He 


Leu 


Glu 


Glu 


His 


Gly 


Lys 










245 










250 










255 




Asp 


Leu 


Glu 


lie 


Met 


Gin 


He 


Leu 


Thr 


Arg 


Val 


Asn 


Asp 


Arg 


Val 


Ala 
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260 265 270 

Arg His Phe Glu Ser Gin Ser Asp Asp Pro His Phe His Glu Lys Lys 

275 280 285 

Gin lie Pro Cys Val Val Ser Met Leu Thr Lys Glu Leu Tyr Phe Ser 
290 295 300 

Gin 
305 



<210> 15 
<211> 262 
<212> PRT 

<213> Streptomyces sp . K15 
<400> 15 

Val Thr Lys Pro Thr lie Ala Ala Val Gly Gly Tyr Ala Met Asn Asn 

15 10 15 

Gly Thr Gly Thr Thr Leu Tyr Thr Lys Ala Ala Asp Thr Arg Arg Ser 

20 25 30 

Thr Gly Ser Thr Thr Lys lie Met Thr Ala Lys Val Val Leu Ala Gin 

35 40 45 

Ser Asn Leu Asn Leu Asp Ala Lys Val Thr He Gin Lys Ala Tyr Ser 

50 55 60 

Asp Tyr Val Val Ala Asn Asn Ala Ser Gin Ala His Leu He Val Gly 
65 70 75 80 

Asp Lys Val Thr Val Arg Gin Leu Leu Tyr Gly Leu Met Leu Pro Ser 

85 90 95 

Gly Cys Asp Ala Ala Tyr Ala Leu Ala Asp Lys Tyr Gly Ser Gly Ser 

100 105 110 

Thr Arg Ala Ala Arg Val Lys Ser Phe He Gly Lys Met Asn Thr Ala 

115 120 125 

Ala Thr Asn Leu Gly Leu His Asn Thr His Phe Asp Ser Phe Asp Gly 

130 135 140 

He Gly Asn Gly Ala Asn Tyr Ser Thr Pro Arg Asp Leu Thr Lys He 
145 150 155 ~ 160 

Ala Ser Ser Ala Met Lys Asn Ser Thr Phe Arg Thr Val Val Lys Thr 

165 170 175 

Lys Ala Tyr Thr Ala Lys Thr Val Thr Lys Thr Gly Ser He Arg Thr 

180 185 190 

Met Asp Thr Trp Lys Asn Thr Asn Gly Leu Leu Ser Ser Tyr Ser Gly 

195 200 205 

Ala He Gly Val Lys Thr Gly Ser Gly Pro Glu Ala Lys Tyr Cys Leu 

210 215 220 

Val Phe Ala Ala Thr Arg Gly Gly Lys Thr Val He Gly Thr Val Leu 
225 230 235 240 

Ala Ser Thr Ser He Pro Ala Arg Glu 'Ser Asp Ala Thr Lys He Met 

245 250 255 

Asn Tyr Gly Phe Ala Leu 
260 



<210> 16 
<211> 256 
<212> PRT 

<213> Human cytomegalovirus 
<400> 16 

Met Thr Met Asp Glu Gin Gin Ser 
1 5 

Gly Gly Phe Leu Ala Arg Tyr Asp 
2 0 

Leu Leu Pro Arg Asp Val Val Glu 
35 40 



Gin Ala Val Ala Pro Val Tyr Val 

10 15 

Gin Ser Pro Asp Glu Ala Glu Leu 

25 30 

His Trp Leu His Ala Gin Gly Gin 
45 
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Gly Gin 


Pro 


Ser 


Leu 


Ser 


Val 


Ala 


Leu 


Pro 


Leu 


Asn 


He 


Asn 


His 


Asp 




50 










55 










60 










Asp 


Thr 


Ala 


Val 


Val 


Gly 


His 


Val 


Ala 


Ala 


Met 


Gin 


Ser 


Val 


Arg 


Asp 


65 










7 0 










75 










80 


Gly Leu 


Phe 


Cys 


Leu 


Gly 


Cys 


Val 


Thr 


Ser 


Pro 


Arg 


Phe 


Leu 


Glu 


He 










85 










90 










95 




Val 


Arg 


Arg 


Ala 

inn 


Ser 


Glu 


Lys 


Ser 


Glu 
105 


Leu 


Val 


Ser 


Arg 


Gly 
110 


Pro 


Val 


Ser 


Pro 


Leu 


Gin 


Pro 


Asp 


Lys 


Val 


Val 


Glu 


Phe 


Leu 


Ser 


Gly Ser 


Tyr 






115 










120 










125 








Ala 


Gly 


Leu 


Ser 


Leu 


Ser 


Ser Arg Arg 


Cys 


Asp 


Asp 


Val 


Glu 


Gin 


Ala 




130 










135 










140 










Thr 


Ser 


Leu 


Ser 


Gly 


Ser 


Glu 


Thr 


Thr 


Pro 


Phe 


Lys 


His 


Val 


Ala 


Leu 


145 










15 0 










155 










160 


Cys 


Ser 


Val 


Gly 


Arg 


Arg 


Arg 


Gly 


Thr 


Leu 


Ala 


Val 


Tyr 


Gly Arg Asp 










165 










170 










175 




Pro 


Glu 


Trp 


Val 
180 


Thr 


Gin 


Arg 


Phe 


Pro 
185 


Asp 


Leu 


Thr 


Ala 


Ala 
190 


Asp 


Arg 


Asp 


Gly 


Leu 
195 


Arg 


Ala 


Gin 


Trp 


Gin 
200 


Arg 


Cys 


Gly 


Ser 


Thr 
205 


Ala 


Val 


Asp 


Ala 


Ser 


Gly Asp 


Pro 


Phe 


Arg 


Ser 


Asp 


Ser 


Tyr 


Gly 


Leu 


Leu 


Gly Asn 




210 










215 










220 










Ser 


Val 


Asp 


Ala 


Leu 


Tyr 


lie 


Arg 


Glu 


Arg 


Leu 


Pro 


Lys 


Leu 


Arg 


Tyr 


225 










230 










235 










240 


Asp 


Lys 


Gin 


Leu 


Val 
245 


Gly 


Val 


Thr 


Glu 


Arg 
250 


Glu 


Ser 


Tyr 


Val 


Lys 
255 


Ala 



<210> 17 

<211> 248 

<212> PRT 

<213> Escherichia coli 



<400> 17 



Val 


Arg 


Ser 


Phe 


He 


Tyr 


Glu 


Pro 


Phe 


Gin 


He 


Pro 


Ser 


Gly 


Ser 


Met 


1 








5 










10 










15 




Met 


Pro 


Thr 


Leu 


Leu 


He 


Gly Asp 


Phe 


He 


Leu 


Val 


Glu 


Lys 


Phe 


Ala 








20 










25 










30 






Tyr 


Gly 


He 


Lys 


Asp 


Pro 


He 


Tyr 


Gin 


Lys 


Thr 


Leu 


He 


Glu 


Thr .Gly 






35 










40 










45 








His 


Pro 


Lys 


Arg 


Gly 


Asp 


He 


Val 


Val 


Phe 


Lys 


Tyr 


Pro 


Glu 


Asp 


Pro 




50 










55 










60 










Lys 


Leu 


Asp 


Tyr 


He 


Lys 


Arg 


Ala 


Val 


Gly 


Leu 


Pro 


Gly 


Asp 


Lys 


Val 


65 










70 










75 










80 


Thr 


Tyr 


Asp 


Pro 


Val 


Ser 


Lys 


Glu 


Leu 


Thr 


He 


Gin 


Pro 


Gly 


Cys 


Ser 










85 










90 










95 




Ser 


Gly 


Gin 


Ala 


Cys 


Glu 


Asn 


Ala 


Leu 


Pro 


Val 


Thr 


Tyr 


Ser 


Asn 


Val 








100 










105 










110 






Glu 


Pro 


Ser 


Asp 


Phe 


Val 


Gin 


Thr 


Phe 


Ser 


Arg 


Arg 


Asn 


Gly Gly Glu 






115 










120 










125 








Ala 


Thr 


Ser 


Gly 


Phe 


Phe 


Glu 


Val 


Pro 


Lys 


Asn 


Glu 


Thr 


Lys Glu 


Asn 




130 










135 










140 










Gly 


He 


Arg 


Leu 


Ser 


Glu 


Arg 


Lys 


Glu 


Thr 


Leu 


Gly Asp Val 


Thr 


His 


145 










150 










155 










160 


Arg 


He 


Leu 


Thr 


Val 


Pro 


He 


Ala 


Gin 


Asp 


Gin 


Val 


Gly Met 


Tyr 


Tyr 










165 










170 










175 




Gin 


Gin 


Pro 


Gly 


Gin 


Gin 


Leu 


Ala 


Thr 


Trp 


He 


Val 


Pro 


Pro 


Gly 


Gin 








180 










185 










190 






Tyr 


Phe 


Met 


Met 


Gly 


Asp 


Asn 


Arg 


Asp 


Asn 


Ser 


Ala 


Asp 


Ser 


Arg 


Tyr 






195 










200 










205 








Trp 


Gly 


Phe 


Val 


Pro 


Glu 


Ala 


Asn 


Leu 


Val 


Gly Arg 


Ala 


Thr 


Ala 


He 




210 










215 










220 










Trp 


Met 


Ser 


Phe 


Asp 


Lys 


Gin 


Glu 


Gly 


Glu 


Trp 


Pro 


Thr 


Gly 


Leu 


Arg 
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225 230 235 240 

Leu Ser Arg lie Gly Gly lie His 
24 5 



<210> 18 

<211> 317 

<212> PRT 

<213> Serratia marcescens 



<400> 18 



Met Glu 


Gin 


Leu 


Arg 


Gly 


Leu 


Tyr 


Pro 


Pro 


Leu 


Ala 


Ala 


Tyr 


Asp 


Ser 


1 






5 










10 










15 




Gly Trp 


Leu 


Asp 


Thr 


Giy 


Asp 


Gly 


His 


Arg 


He 


Tyr 


Trp 


Glu 


Leu 


Ser 






20 










25 










30 






Gly Asn 


Pro 


Asn 


Gly 


Lys 


Pro 


Ala 


Val 


Phe 


He 


His 


Gly 


Gly 


Pro 


Gly 




35 










40 










45 








Gly Gly 


He 


Ser 


Pro 


His 


His 


Arg 


Gin 


Leu 


Phe 


Asp 


Pro 


Glu 


Arg 


Tyr 


50 










55 










60 










Lys Val 


Leu 


Leu 


Phe 


Asp 


Gin 


Arg 


Gly 


Cys 


Gly 


Arg 


Ser 


Arg 


Pro 


His 


65 








70 










75 










80 


Ala Ser 


Leu 


Asp 


Asn 


Asn 


Thr 


Thr 


Trp 


His 


Leu 


Val 


Ala 


Asp 


He 


Glu 








85 










90 










95 




Arg Leu 


Arg 


Glu 


Met 


Ala 


Gly 


Val 


Glu 


Gin 


Trp 


Leu 


Val 


Phe 


Gly 


Gly 






100 










105 










110 






Ser Trp 


Gly 


Ser 


Thr 


Leu 


Ala 


Leu 


Ala 


Tyr 


Ala 


Gin 


Thr 


His 


Pro 


Glu 




115 










120 










125 








Arg Val 


Ser 


Glu 


Met 


Val 


Leu 


Arg 


Gly 


He 


Phe 


Thr 


Leu 


Arg 


Lys 


Gin 


130 










135 










140 










Arg Leu 


His 


Trp 


Tyr 


Tyr 


Gin 


Asp 


Gly 


Ala 


Ser 


Arg 


Phe 


Phe 


Pro 


Glu 


145 








150 










155 










160 


Lys Trp 


Glu 


Arg 


Val 


Leu 


Ser 


He 


Leu 


Ser 


Asp 


Asp 


Glu 


Arg 


Lys 


Asp 








165 










170 










175 




Val He 


Ala 


Ala 


Tyr 


Arg 


Gin 


Arg 


Leu 


Thr 


Ser 


Ala 


Asp 


Pro 


Gin 


Val 






180 










185 










19 0 






Gin Leu 


Glu 


Ala 


Ala 


Lys 


Leu 


Trp 


Ser 


Val 


Trp 


Glu 


Gly 


Glu 


Thr 


Val 




195 










200 










205 








Thr Leu 


Leu 


Pro 


Ser 


Arg 


Glu 


Ser 


Ala 


Ser 


Phe 


Gly 


Glu 


Asp 


Asp 


Phe 


210 










215 










2 2 0 










Ala Leu 


Ala 


Phe 


Ala 


Arg 


He 


Glu 


Asn 


His 


Tyr 


Phe 


Thr 


His 


Leu 


Gly 


225 








230 










235 










240 


Phe Leu 


Glu 


Ser 


Asp 


Asp 


Gin 


Leu 


Leu 


Arg 


Asn 


Val 


Pro 


Leu 


He 


Arg 








245 










250 










255 




His He 


Pro 


Ala 


Val 


He 


Val 


His 


Gly 


Arg 


Tyr 


Asp 


Met 


Ala 


Cys 


Gin 






260 










265 










270 






Val Gin 


Asn 


Ala 


Trp 


Asp 


Leu 


Ala 


Lys 


Ala 


Trp 


Pro 


Glu 


Ala 


Glu 


Leu 




275 










280 










285 








His He 


Val 


Glu 


Gly Ala 


Gly 


His 


Ser 


Tyr 


Asp 


Glu 


Pro 


Gly 


He 


Leu 


290 










295 










300 










His Gin 


Leu 


Met 


He 


Ala 


Thr 


Asp 


Arg 


Phe 


Ala 


Gly 


Lys 








305 








310 










315 












<210> 19 




























<211> 229 




























<212> PRT 




























<213> Escherichia coli 






















<400> 19 




























Met Glu 


Leu 


Leu 


Leu 


Leu 


Ser 


Asn 


Ser 


Thr 


Leu 


Pro 


Gly 


Lys 


Ala 


Trp 


1 






5 










10 










15 




Leu Glu 


His 


Ala 


Leu 


Pro 


Leu 


He 


Ala 


Asn 


Gin 


Leu 


Asn 


Gly 


Arg 


Arg 






20 










25 










30 
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Ser Ala 


Val 


Phe 


He 


Pro 


Phe 


Ala 


Gly 


Val 


Thr 


Gin 


Thr 


Trp 


Asp 


Glu 




35 










40 










45 








Tyr Thr 


Asp 


Lys 


Thr 


Ala 


Glu 


Val 


Leu 


Ala 


Pro 


Leu 


Gly Val 


Asn 


Val 


50 










55 










60 










Thr Gly 


He 


His 


Arg 


Val 


Ala 


Asp 


Pro 


Leu 


Ala 


Ala 


He 


Glu 


Lys 


Ala 


65 








70 










75 










80 


Glu lie 


He 


He 


Val 


Gly 


Gly 


Gly 


Asn 


Thr 


Phe 


Gin 


Leu 


Leu 


Lys 


Glu 








85 










90 










95 




Ser Arg 


Glu 


Arg 


Gly 


Leu 


Leu 


Ala 


Pro 


Met 


Ala 


Asp 


Arg 


Val 


Lys 


Arg 






100 










105 










110 






Gly Ala 


Leu 


Tyr 


He 


Gly 


Trp 


Ser 


Ala 


Gly Ala 


Asn 


Leu 


Ala 


Cys 


Pro 




115 










120 










125 








Thr lie 


Arg 


Thr 


Thr 


Asn 


Asp 


Met 


Pro 


He 


Val 


Asp 


Pro 


Asn 


Gly 


Phe 


13 0 










135 










140 










Asp Ala 


Leu 


Asp 


Leu 


Phe 


Pro 


Leu 


Gin 


He 


Asn 


Pro 


His 


Phe 


Thr 


Asn 


145 








15 0 










155 










160 


Ala Leu 


Pro 


Glu 


Gly 


His 


Lys 


Gly 


Glu 


Thr 


Arg 


Glu 


Gin 


Arg 


He 


Arg 








165 










170 










175 




Glu Leu 


Leu 


Val 


Val 


Ala 


Pro 


Glu 


Leu 


Thr 


Val 


He 


Gly 


Leu 


Pro 


Glu 






180 










185 










190 






Gly Asn 


Trp 


He 


Gin 


Val 


Ser 


Asn 


Gly 


Gin 


Ala 


Val 


Leu 


Gly 


Gly 


Pro 




195 










200 










/£> U 3 








Asn Thr 


Thr 


Trp 


Val 


Phe 


Lys 


Ala 


Gly 


Glu 


Glu 


Ala 


Val 


Ala 


Leu 


Glu 


210 










215 










220 










Ala Gly 


His 


Arg 


Phe 
























225 






























<210> 20 




























<211> 99 




























<212> PRT 




























<213> Human immunodeficiency virus 
















<400> 20 




























Pro Gin 


He 


Thr 


Leu 


Trp 


Gin 


Arg 


Pro 


Leu 


Val 


Thr 


Val 


Lys 


He 


Gly 


1 






5 










_L U 










15 




Gly Gin 


Leu 


Arg 


Glu 


Ala 


Leu 


Leu 


Asp 


Thr 


Gly Ala Asp Asp 


Thr 


Val 






20 










25 










30 






Leu Glu 


Asp 


He 


Asn 


Leu 


Pro 


Gly Lys 


Trp 


Lys 


Pro 


Lys 


Met 


He 


Gly 




35 










40 










45 








Gly lie 


Gly 


Gly 


Phe 


He 


Lys 


Val 


Arg 


Gin 


Tyr 


Asp 


Gin 


He 


Leu 


He 


50 










55 










SO 










Glu lie 


Cys 


Gly 


Lys 


Lys 


Ala 


He 


Gly 


Thr 


Val 


Leu 


Val 


Gly 


Pro 


Thr 


65 








70 










75 










80 


Pro Val 


Asn 


He 


He 


Gly 


Arg 


Asn 


Met 


Leu 


Thr 


Gin 


He 


Gly 


Cys 


Thr 








85 










90 










95 




Leu Asn 


Phe 




























<210> 21 




























<211> 297 




























<212> PRT 




























<213> Escherichia coli 






















<400> 21 




























Ser Thr 


Glu 


Thr 


Leu 


Ser 


Phe 


Thr 


Pro 


Asp 


Asn 


He 


Asn 


Ala 


Asp 


He 


1 






5 










10 










15 




Ser Leu 


Gly 


Thr 


Leu 


Ser 


Gly Lys 


Thr 


Lys 


Glu 


Arg 


Val 


Tyr 


Leu 


Ala 






20 










25 










30 






Glu Glu 


Gly 


Gly 


Arg 


Lys 


Val 


Ser 


Gin 


Leu 


Asp 


Trp 


Lys 


Phe 


Asn 


Asn 




35 










40 










45 








Ala Ala 


He 


He 


Lys 


Gly Ala 


He 


Asn 


Trp 


Asp 


Leu 


Met 


Pro 


Gin 


He 
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50 










55 




Ser 


He 


Gly Ala Ala 


Gly 


Trp 


Thr 


65 










70 






Met 


Val 


Asp 


Gin 


Asp 


Trp 


Met 


Asp 










85 








Asp 


Glu 


Ala 


Arg 


His 


Pro 


Asp 


Thr 








100 










Asp 


Leu 


Asn 


He 


Lys 


Gly 


Trp 


Leu 






115 










120 


Gly 


Leu 


Met 


Ala 


Gly 


Tyr 


Gin 


Glu 




130 










135 




Gly Gly 


Ser 


Tyr 


He 


Tyr 


Ser 


Ser 


145 










150 






Gly 


Ser 


Phe 


Pro 


Asn 


Gly 


Glu 


Arg 










165 








Lys 


Met 


Pro 


Tyr 


He 


Gly 


Leu 


Thr 








180 










Glu 


Leu 


Gly Gly 


Thr 


Phe 


Lys 


Tyr 






195 










200 


Asn 


Asp 


Glu 


His 


Tyr 


Asp 


Pro 


Lys 




210 










215 




Val 


Lys 


Asp 


Gin 


Asn 


Tyr 


Tyr 


Ser 


225 










230 






Val 


Thr 


Pro 


Asn 


Ala 


Lys 


Val 


Tyr 










245 








Thr 


Asn 


Lys 


Lys 


Gly Asn 


Thr 


Ser 








260 










Ser 


Asp 


Tyr 


Ser 


Lys 


Asn 


Gly Ala 






275 










280 


Thr 


Thr 


Ala 


Gly 


Leu 


Lys 


Tyr 


Thr 




290 










295 





<210> 22 

<211> 212 

<212> PRT 

<213> Carica papaya 



<400> 22 



He 


Pro 


Glu 


Tyr 


Val 


Asp 


Trp 


Arg 


1 








5 








Lys 


Asn 


Gin 


Gly 


Ser 


Cys 


Gly Ser 








20 










Thr 


He 


Glu 


Gly 


He 


He 


Lys 


He 






35 










40 


Ser 


Glu 


Gin 


Glu 


Leu 


Leu 


Asp 


Cys 




50 










55 




Gly 


Gly 


Tyr 


Pro 


Trp 


Ser 


Ala 


Leu 


65 










70 






His 


Tyr 


Arg 


Asn 


Thr 
85 


Tyr 


Pro 


Tyr 


Ser 


Arg 


Glu 


Lys 


Gly 


Pro 


Tyr 


Ala 








100 










Val 


Gin 


Pro 


Tyr 


Asn 


Gin 


Gly Ala 






115 










120 


Pro 


Val 


Ser 


Val 


Val 


Leu 


Gin 


Ala 




130 










135 




Arg 


Gly 


Gly 


He 
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Arg 


Met 


Thr 


Thr 
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Asn 


Glu 


Val 


Asp 
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Leu 
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Lys 


Asn 


Asn 


Lys 


Gly 


He 


Ala 


He 


Leu 


Gly 


Ser 


Arg 


Val 


Glu 


Ser 
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Arg 


Asn 


Gly Met 
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Ala 


Gly 
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Ala 


Met 


Ala 


Val 


Val 


Gly 


Asn 


Ala 






115 










120 
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Lys 


Leu 


Asn 
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Gly 


Gin 


Glu 


Val 


He 


He 


He 


Trp 


Asn 


Pro 


Trp 


Asp 
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Met 


Thr 


Gin 


Asp 


Ala 
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Asn 


Asn 


Val 


He 
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Val 


Ser 


145 
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Trp 


Tyr 
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He 


Tyr 
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Tyr 
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<213> Saccharomyces cerevif 


siae 


















<400> 28 
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Val 


Gin 
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Thr 
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Glu 


Phe 
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85 










90 










95 




Gin 


He 


Asp 


Lys 


Leu 


Asp 


Lys 


He 
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Leu 
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He 
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Ser 
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Asn 
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Met 


Ser 
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He 
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140 










Leu 


Thr 


Asp 
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Gin 


Lys 
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Val 
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Glu 
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T-Tn d 
riJ. to 


Thr 
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He 
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<212> PRT 

<213> Pyrococcus horikoshii 
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He 
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Met 


He 






115 










120 
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Gly Val 
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Trp 


Val 


Asp 


Ala 


Glu 


Val 


Val 


Val 


Asp 


Gly 


Asn 
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Trp 


Val 


Ser 


Ser 


Arg 


Val 
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Ala 


Asp 


Leu 


Tyr 
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Trp 


Met 


Arg 
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145 
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Phe 


Val 
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Leu 


Lys 























165 



<210> 30 

<211> 316 

<212> PRT 

<213> Bacillus thermoprot eolyticus 
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Asn 
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Val 
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Gly 


Gin 
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Thr 


Phe 


He 
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Leu 


Ser 


Gly 


Gly 


He 


Asp 


Val 


Val 


Ala 


His 


Glu 


Leu 
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140 










Thr 
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Ala 


Val 


Thr 


Asp 


Tyr 


Thr 


Ala 


Gly 


Leu 


He 


Tyr 


Gin 


Asn 


Glu 


145 










150 
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Ser 


Gly 


Ala 


He 


Asn 


Glu 


Ala 


He 


Ser 


Asp 


He 


Phe 


Gly Thr 


Leu 


Val 










165 










170 
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Phe 


Tyr 


Ala 


Asn 


Lys 


Asn 


Pro 


Asp 


Trp 


Glu 


He 


Gly Glu Asp 


Val 
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Thr 


Pro 


Gly 
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Ser 
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Ser 


Leu 


Arg 


Ser 


Met 


Ser 


Asp 


Pro 
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205 








Ala 


Lys 


Tyr 
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Asp 


Pro 


Asp 


His 


Tyr 


Ser 


Lys 


Arg 


Tyr 


Thr 


Gly 


Thr 
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Gin 


Asp 


Asn 


Gly 


Gly 


Val 


His 


He 


Asn 


Ser 


Gly 


He 


He 


Asn 


Lys 


Ala 
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225 










230 








235 










240 


Ala 


Tyr 


Leu 


lie 


Ser 


Gin Gly Gly Thr 


His 


Tyr 


Gly Val 


Ser 


Val 


Val 
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250 
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Gly 


lie 


Gly Arg Asp 


Lys 


Leu 


Gly Lys 


lie 


Phe 


Tyr 


Arg 


Ala 


Leu 


Thr 








260 








265 










270 






Gin 


Tyr 


Leu 


Thr 


Pro 


Thr 


Ser 


Asn Phe 


Ser 


Gin 


Leu 


Arg 


Ala 


Ala 


Ala 






275 










280 








285 








Val 


Gin 


Ser 


Ala 


Thr 


Asp 


Leu 


Tyr Gly 


Ser 


Thr 


Ser 


Gin 


Glu 


Val 


Ala 




290 










295 








300 










Ser 


Val 


Lys 


Gin 


Ala 


Phe 


Asp 


Ala Val 


Gly Val 


Lys 










305 










310 








315 













<210> 31 

<211> 169 

<212> PRT 

<213> Homo sapiens 



<400> 31 



Val 


Leu 


Thr 


Glu 


Gly 


Asn 


Pro 


Arg 


Trp 


Glu 


Gin 


Thr 


His 


Leu 


Thr Tyr 


1 








5 










10 










15 


Arg 


lie 


Glu 


Asn 


Tyr 


Thr 


Pro 


Asp 


Leu 


Pro 


Arg 


Ala 


Asp 


Val 


Asp His 
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Ala 
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Glu 


Lys 


Ala 


Phe 


Gin 


Leu 


Trp 


Ser 


Asn 


Val 


Thr 


Pro 


Leu Thr 






35 










40 
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Phe 


Thr 


Lys 


Val 


Ser 


Glu 


Gly Gin Ala 


Asp 


He 


Met 


lie 


Ser 


Phe Val 
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60 








Arg 


Gly Asp 


His 


Arg 


Asp 


Asn 


Ser 


Pro 


Phe 


Asp 


Gly 


Pro 


Gly 


Gly Asn 
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Leu 


Ala 
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Ala 


Phe 


Gin 


Pro 


Gly 


Pro 


Gly 


He 


Gly 


Gly 


Asp 


Ala His 










85 
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95 


Phe 


Asp 


Glu 


Asp 


Glu 


Arg 


Trp 


Thr 


Asn 


Asn 


Phe 


Arg 


Glu 


Tyr 


Asn Leu 
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105 










110 
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Arg 


Val 


Ala 


Ala 


His 


Glu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


Leu 


Ser His 
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120 










125 






Ser 


Thr 


Asp 


lie 


Gly 


Ala 


Leu 


Met 


Tyr 


Pro 


Ser 


Tyr 


Thr 


Phe 


Ser Gly 




130 










135 










140 








Asp 


Val 


Gin 


Leu 


Ala 


Gin 


Asp 


Asp 


lie 


Asp 


Gly 


He 


Gin 


Ala 


lie Tyr 


145 










150 










155 
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Gly 


Arg 


Ser 


Gin 


Asn 


Pro 


Val 


Gin 


Pro 















165 



<210> 32 

<211> 496 

<212> PRT 

<213> Homo sapiens 



<400> 32 



Gin 


Tyr 


Ser 


Pro 


Asn 


Thr 


Gin 


Gin 


1 








5 








Phe 


Glu 


Trp 


Arg 


Trp 


Val 


Asp 


lie 








20 










Ala 


Pro 


Lys 


Gly 


Phe 


Gly 


Gly Val 






35 










40 


Val 
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He 


Tyr 


Asn 


Pro 


Phe 


Arg 
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Val 


Ser 


Tyr 


Lys 


Leu 


Cys 


Thr 


Arg 


65 
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Asn 


Met 


Val 


Thr 


Arg 


Cys 


Asn 


Asn 










85 








Ala 


Val 


He 


Asn 


His 


Met 


Cys 


Gly 



100 



Gly Arg Thr Ser He Val His Leu 

10 15 
Ala Leu Glu Cys Glu Arg Tyr Leu 
25 3 0 

Gin Val Ser Pro Pro Asn Glu Asn 
45 

Pro Trp Trp Glu Arg Tyr Gin Pro 
60 

Ser Gly Asn Glu Asp Glu Phe Arg 

75 80 
Val Gly Val Arg He Tyr Val Asp 

90 95 
Asn Ala Val Ser Ala Gly Thr Ser 
105 110 
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Ser 


Thr 


Cys 


Gly Ser 


Tyr 


Phe 


Asn 


Pro 


Gly Ser Arg Asp 


Phe 


Pro 


Ala 






115 










120 
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Val 


Pro 

J. J U 


Tyr 


Ser 


Gly 


Trp 


Asp 
135 


Phe 


Asn 


Asp 


Gly 


Lys 
140 


Cys 


Lys 


Thr 


Gly 


Ser 


Gly 


Asp 


He 


Glu 


Asn- 


Tyr 


Asn 


Asp 


Ala 


Thr 


Gin 


Val 


Arg 


Asp 


Cys 


14 5 










150 










155 










160 


Arg 


Leu 


Thr 


Gly 


Leu 
165 


Leu 


Asp 


Leu 


Ala 


Leu 
170 


Glu 


Lys 


Asp 


Tyr 


Val 
175 


Arg 


Ser 


Lys 


lie 


Ala 
180 


Glu 


Tyr 


Met 


Asn 


His 
185 


Leu 


He 


Asp 


He 


Gly 
190 


Val 


Ala 


Gly 


Phe 


Arg 


Leu Asp 


Ala 


Ser 


Lys 


His 


Met 


Trp 


Pro 


Gly Asp 


He 


Lys 
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200 










205 








Ala 


lie 


Leu 


Asp 


Lys 


Leu 


His 


Asn 


Leu 


Asn 


Ser 


Asn 


Trp 


Phe 


Pro 


Ala 




210 










215 










220 








Gly 


Ser 


Lys 


Pro 


Phe 


He 


Tyr 


Gin 


Glu 


Val 


He 


Asp 


Leu 


Gly 


Gly 


Glu 


225 










230 










235 










240 


Pro 


lie 


Lys 


Ser 


Ser 


Asp 


Tyr 


Phe 


Gly Asn 


Gly Arg 


Val 


Thr 


Glu 


Phe 










245 










250 










255 




Lys 


Tyr 


Gly 


Ala 
260 


Lys 


Leu 


Gly 


Thr 


Val 
265 


He 


Arg 


Lys 


Trp 


Asn 
270 


Gly 


Glu 


Lys 


Met 


Ser 


Tyr 


Leu 


Lys 


Asn 


Trp 


Gly Glu Gly 


Trp 


Gly 


Phe 


Val 


Pro 






275 










280 










285 








Ser 


Asp 
2 90 


Arg 


Ala 


Leu 


Val 


Phe 
295 


Val 


Asp 


Asn 


His 


Asp 
300 


Asn 


Gin 


Arg 


Gly 


His 


Gly 


Ala 


Gly 


Gly 


Ala 


Ser 


He 


Leu 


Thr 


Phe 


Trp 


Asp 


Ala 


Arg 


Leu 


3 0 5 










310 










315 










320 


Tyr 


Lys 


Met 


Ala 


Val 
325 


Gly 


Phe 


Met 


Leu 


Ala 
330 


His 


Pro 


Tyr 


Gly 


Phe 
335 


Thr 


Arg 


Val 


Met 


Ser 
340 


Ser 


Tyr 


Arg 


Trp 


Pro 
345 


Arg 


Gin 


Phe 


Gin 


Asn 
350 


Gly 


Asn 


Asp 


Val 


Asn 


Asp 


Trp 


Val 


Gly 


Pro 


Pro 


Asn 


Asn 


Asn 


Gly Val 


He 


Lys 
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360 








365 






Glu 


Val 


Thr 


He 


Asn 


Pro 


Asp 


Thr 


Thr 


Cys 


Gly Asn 


Asp 


Trp 


Val 


Cys 
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Arg 
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Arg 


Gin 


He 


Arg 


Asn 


Met 


Val 


He 


Phe 


Arg 


Asn 


Val 
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Val 


Asp 


Gly 


Gin 


Pro 
405 


Phe 


Thr 


Asn 


Trp 


Tyr 
410 


Asp 


Asn 


Gly 


Ser 


Asn 
415 


Gin 


Val 


Ala 


Phe 


Gly Arg 


Gly Asn 


Arg 


Gly 


Phe 


He 


Val 


Phe 


Asn 


Asn 


Asp 








420 










425 










430 






Asp 


Trp 


Ser 


Phe 


Ser 


Leu 


Thr 


Leu 


Gin 


Thr 


Gly Leu 


Pro 


Ala 


Gly 


Thr 
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440 










445 






Tyr 


Cys 
450 


Asp 


Val 


He 


Ser 


Gly 
455 


Asp 


Lys 


He 


Asn 


Gly 
460 
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Cys 


Thr 


Gly 


lie 


Lys 


He 


Tyr 


Val 


Ser 


Asp 


Asp 


Gly 


Lys 


Ala 
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Phe 


Ser 


He 


Ser 


465 
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Ala 


Glu 


Asp 
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Phe 


He 


Ala 


He 
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Ala 


Glu 


Ser 


Lys 
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Leu 



<210> 33 
<211> 370 
<212> PRT 

<213> Trichoderma reesei 
<400> 33 

Gin Pro Gly Thr Ser Thr Pro Glu 
1 5 

Lys Cys Thr Lys Ser Gly Gly Cys 
20 

Leu Asp Trp Asn Tyr Arg Trp Met 

35 40 
Thr Val Asn Gly Gly Val Asn Thr 



Val His Pro Lys Leu Thr Thr Tyr 

10 15 

Val Ala Gin Asp Thr Ser Val Val 

25 30 

His Asp Ala Asn Tyr Asn Ser Cys 
45 

Thr Leu Cys Pro Asp Glu Ala Thr 
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Cys 
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Ser 
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Val 
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Ser 


Ser 
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Met 
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Tyr 


Met 


Pro 


Ser 










85 
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Ser 


Ser 


Gly 


Gly 


Tyr 


Ser 


Ser 


Val 


Ser 


Pro 


Arg 


Leu 


Tyr 


Leu 


Leu 


Asp 








100 










105 
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Ser 


Asp 


Gly 


Glu 


Tyr 


Val 


Met 


Leu 


Lys 


Leu 


Asn 


Gly Gin Glu 


Leu 


Ser 






115 










120 










125 








Phe 


Asp 


Val 


Asp 


Leu 


Ser 


Ala 


Leu 


Pro 


Cys 


Gly Glu Asn 


Gly 


Ser 


Leu 




130 










135 










140 










Tyr 


Leu 


Ser 


Gin 


Met 


Asp 


Glu 


Asn 


Gly Gly 


Ala 


Asn 


Gin 


Tyr 


Asn 


Thr 


145 










150 
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Ala 


Gly 


Ala 


Asn 


Tyr 


Gly 


Ser 


Gly 


Tyr 


Cys 


Asp 


Ala 


Gin 


Cys 


Pro 


Val 










165 










170 










175 




Gin 


Thr 


Trp 


Arg 


Asn 


Gly 


Thr 


Leu 


Asn 


Thr 


Ser 


His 


Gin 


Gly 


Phe 


Cys 








180 










185 










190 






Cys 


Asn, 


Glu 


Met 


Asp 


lie 


Leu 


Glu 


Gly Asn 


Ser 


Arg 


Ala 


Asn 


Ala 


Leu 
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200 










205 








Thr 


Pro 


His 


Ser 


Cys 


Thr 


Ala 


Thr 


Ala 


Cys 


Asp 


Ser 


Ala 


Gly 


Cys 


Gly 




210 










215 










220 










Phe 


Asn 


Pro 


Tyr 


Gly 


Ser 


Gly 


Tyr 


Lys 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Gly Asp 


225 










230 










235 










240 


Thr 


Val 


Asp 


Thr 


Ser 


Lys 


Thr 


Phe 


Thr 


He 


He 


Thr 


Gin 


Phe 


Asn 


Thr 










245 










250 










255 




Asp 


Asn 


Gly 


Ser 


Pro 


Ser 


Gly 


Asn 


Leu 


Val 


Ser 


He 


Thr 


Arg 


Lys 


Tyr 








260 










265 










270 






Gin 


Gin 


Asn 


Gly 


Val 


Asp 


lie 


Pro 


Ser 


Ala 


Gin 


Pro 


Gly 


Gly 


Asp 


Thr 






275 










280 










285 








lie 


Ser 


Ser 


Cys 


Pro 


Ser 


Ala 


Ser 


Ala 


Tyr 


Gly 


Gly 


Leu 


Ala 


Thr 


Met 




290 










295 










300 










Gly 


Lys 


Ala 


Leu 


Ser 


Ser 


Gly 


Met 


Val 


Leu 


Val 


Phe 


Ser 


He 


Trp 


Asn 


305 










310 










315 










320 


Asp 


Asn 


Ser 


Gin 


Tyr 


Met 


Asn 


Trp 


Leu 


Asp 


Ser 


Gly 


Asn 


Ala 


Gly 


Pro 










325 










330 










335 




Cys 


Ser 


Ser 


Thr 


Glu 


Gly 


Asn 


Pro 


Ser 


Asn 


He 


Leu 


Ala 


Asn 


Asn 


Pro 








340 










345 










350 






Asn 


Thr 


His 


Val 


Val 


Phe 


Ser 


Asn 


lie 


Arg 


Trp 


Gly Asp 


He 


Gly 


Ser 






355 










360 










365 









Thr Thr 



370 



<210> 34 

<211> 223 

<212> PRT 

<213> Aspergillus niger 

<400> 34 



Gin 


Thr 


Met 


Cys 


Ser 


Gin 


Tyr 


Asp 


Ser 


Ala 


Ser 


Ser 


Pro 


Pro 


Tyr 


Ser 


1 








5 










10 










15 




Val 


Asn 


Gin 


Asn 


Leu 


Trp 


Gly 


Glu 


Tyr 


Gin 


Gly 


Thr 


Gly 


Ser 


Gin 


Cys 








20 










25 










30 






Val 


Tyr 


Val 


Asp 


Lys 


Leu 


Ser 


Ser 


Ser 


Gly 


Ala 


Ser 


Trp 


His 


Thr 


Glu 






35 










40 










45 








Trp 


Thr 


Trp 


Ser 


Gly 


Gly 


Glu 


Gly 


Thr 


Val 


Lys 


Ser 


Tyr 


Ser 


Asn 


Ser 




50 










55 










60 










Gly 


Val 


Thr 


Phe 


Asn 


Lys 


Lys 


Leu 


Val 


Ser 


Asp 


Val 


Ser 


Ser 


He 


Pro 


65 










70 










75 










80 


Thr 


Ser 


Val 


Glu 


Trp 


Lys 


Gin 


Asp 


Asn 


Thr 


Asn 


Val 


Asn 


Ala 


Asp 


Val 










85 










90 










95 




Ala 


Tyr 


Asp 


Leu 


Phe 


Thr 


Ala 


Ala 


Asn 


Val 


Asp 


His 


Ala 


Thr 


Ser 


Ser 



100 105 110 
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25 
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Gly Asp Tyr Glu Leu 
115 

Pro He Gly Lys Gin 
130 

Glu Val Trp Tyr Gly 
145 

Tyr Ser Phe Val Ser 
165 

Asn Ala Phe Phe Ser 
180 

Ser Gin Tyr Leu He 
195 

Gly Pro Ala Thr Phe 
210 



Met lie Trp Leu Ala 
120 

He Ala Thr Ala Thr 
135 

Ser Thr Thr Gin Ala 
150 

Glu Ser Pro He Asn 
170 

Tyr Leu Thr Gin Asn 
185 

Asn Leu Gin Phe Gly 
200 

Thr Val Asp Asn Trp 
215 



Arg Tyr Gly Asn He Gin 
125 

Val Gly Gly Lys Ser Trp 
140 

Gly Ala Glu Gin Arg Thr 
155 160 
Ser Tyr Ser Gly Asp He 
175 

Gin Gly Phe Pro Ala Ser 
190 

Thr Glu Ala Phe Thr Gly 
205 

Thr Ala Ser Val Asn 
220 



<210> 35 

<211> 184 

<212> PRT 

<213> Aspergillus niger 



<400> 35 

Ser Ala Gly He 

1 

Phe Thr Tyr Asp 
20 

Gly Val Ser Ser 
35 

Ser Asn Ala He 
50 

Ser Tyr Leu Ala 
65 

Tyr He Val Glu 

Ser Leu Gly Thr 
100 

Asp Thr Arg Thr 
115 

Gin Tyr Phe Ser 
130 

Val Ala Asn His 
145 

Asp Phe Asn Tyr 

Ser Ala Ser Val 
18 0 



Asn 


Tyr 


Val 


Gin 


5 








Glu 


Ser 


Ala 


Gly 


Asp 


Phe 


Val 


Val 








40 


Thr 


Tyr 


Ser 


Ala 






55 




Val 


Tyr 


Gly 


Trp 




70 






Asp 


Tyr 


Gly 


Asp 


85 








Val 


Tyr 


Ser 


Asp 


Asn 


Glu 


Pro 


Ser 








120 


Val 


Arg 


Glu 


Ser 






135 




Phe 


Asn 


Phe 


Trp 




150 






Gin 


Val 


Val 


Ala 


165 








Thr 


He 


Ser 


Ser 



Asn 


Tyr 


Asn 


Gly 




10 






Thr 


Phe 


Ser 


Met 


25 








Gly 


Leu 


Gly 


Trp 


Glu 


Tyr 


Ser 


Ala 








60 


Val 


Asn 


Tyr 


Pro 






75 




Tyr 


Asn 


Pro 


Cys 




90 






Gly 


Ser 


Thr 


Tyr 


105 








He 


Thr 


Gly 


Thr 


Thr 


Arg 


Thr 


Ser 








140 


Ala 


His 


His 


Gly 






155 




Val 


Glu 


Ala 


Trp 




170 







Asn Leu Gly Asp 
15 

Tyr Trp Glu Asp 
30 

Thr Thr Gly Ser 
45 

Ser Gly Ser Ala 

Gin Ala Glu Tyr 
80 

Ser Ser Ala Thr 
95 

Gin Val Cys Thr 
110 

Ser Thr Phe Thr 
125 

Gly Thr Val Thr 

Phe Gly Asn Ser 
160 

Ser Gly Ala Gly 
175 



<210> 36 
<211> 313 
<212> PRT 

<213> Streptomyces lividans 
<400> 36 

Ala Glu Ser Thr Leu Gly Ala Ala 
1 5 

Gly Thr Ala He Ala Ser Gly Arg 
20 

He Ala Gly Arg Glu Phe Asn Met 

35 40 
He Asp Ala Thr Glu Pro Gin Arg 

50 55 
Asp Arg Val Tyr Asn Trp Ala Val 



Ala Ala Gin Ser Gly Arg Tyr Phe 

10 15 
Leu Ser Asp Ser Thr Tyr Thr Ser 
25 30 
Val Thr Ala Glu Asn Glu Met Lys 
45 

Gly Gin Phe Asn Phe Ser Ser Ala 
60 

Gin Asn Gly Lys Gin Val Arg Gly 
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65 










70 










75 










80 


His 


Thr 


Leu 


Ala 


Trp 


His 


Ser 


Gin 


Gin 


Pro 


Gly 


Trp 


Met 


Gin 


Ser 


Leu 










85 










90 










95 




Ser 


Gly 


Ser 


Ala 


Leu 


Arg 


Gin 


Ala 


Met 


He 


Asp 


His 


He 


Asn 


Gly 


Val 








100 










105 










110 






Met 


Ala 


His 


Tyr 


Lys 


Gly Lys 


He 


Val 


Gin 


Trp 


Asp 


Val 


Val 


Asn 


Glu 






115 










120 










125 








Ala 


Phe 


Ala 


Asp 


Gly Ser 


Ser 


Gly Ala 


Arg 


Arg 


Asp 


Ser 


Asn 


Leu 


Gin 




130 










135 










140 










Arg 


Ser 


Gly Asn 


Asp 


Trp 


He 


Glu 


Val 


Ala 


Phe 


Arg 


Thr 


Ala 


Arg 


Ala 


145 










150 










155 










160 


Ala 


Asp 


Pro 


Ser 


Ala 


Lys 


Leu 


Cys 


Tyr 


Asn 


Asp 


Tyr 


Asn 


Val 


Glu 


Asn 










165 










170 










175 




Trp 


Thr 


Trp 


Ala 


Lys 


Thr 


Gin 


Ala 


Met 


Tyr 


Asn 


Met 


Val 


Arg 


Asp 


Phe 








180 










185 










190 






Lys 


Gin 


Arg 


Gly 


Val 


Pro 


He 


Asp 


Cys 


Val 


Gly 


Phe 


Gin 


Ser 


His 


Phe 






195 










200 










205 








Asn 


Ser 


Gly 


Ser 


Pro 


Tyr 


Asn 


Ser 


Asn 


Phe 


Arg 


Thr 


Thr 


Leu 


Gin 


Asn 




210 










215 










220 










Phe 


Ala 


Ala 


Leu 


Gly 


Val 


Asp 


Val 


Ala 


He 


Thr 


Glu 


Leu 


Asp 


He 


Gin 


225 










230 










235 










240 


Gly Ala 


Pro 


Ala 


Ser 


Thr 


Tyr 


Ala 


Asn 


Val 


Thr 


Asn 


Asp 


Cys 


Leu 


Ala 










245 










250 










255 




Val 


Ser 


Arg 


Cys 


Leu 


Gly 


He 


Thr 


Val 


Trp 


Gly Val 


Arg 


Asp 


Ser 


Asp 








260 










265 










270 






Ser 


Trp 


Arg 


Ser 


Glu 


Gin 


Thr 


Pro 


Leu 


Leu 


Phe 


Asn 


Asn 


Asp 


Gly 


Ser 






275 










280 










285 








Lys 


Lys 


Ala 


Ala 


Tyr 


Thr 


Ala 


Val 


Leu 


Asp 


Ala 


Leu 


Asn 


Gly 


Gly 


Ala 




290 










295 










300 










Ser 


Ser 


Glu 


Pro 


Pro 


Ala 


Asp 


Gly Gly 
















305 










310 






















<210> 37 




























<211> 362 




























<212> PRT 




























<213> Aspergillus niger 




















<400> 37 




























Met 


His 


Ser 


Phe 


Ala 


Ser 


Leu 


Leu 


Ala 


Tyr 


Gly 


Leu 


Val 


Ala 


Gly 


Ala 


1 








5 










10 










15 




Thr 


Phe 


Ala 


Ser 


Ala 


Ser 


Pro 


He 


Glu 


Ala 


Arg 


Asp 


Ser 


Cys 


Thr 


Phe 








20 










25 










30 






Thr 


Thr 


Ala 


Ala 


Ala 


Ala 


Lys 


Ala 


Gly 


Lys 


Ala 


Lys 


Cys 


Ser 


Thr 


He 






35 










40 










45 








Thr 


Leu 


Asn 


Asn 


He 


Glu 


Val 


Pro 


Ala 


Gly 


Thr 


Thr 


Leu 


Asp 


Leu 


Thr 




50 










55 










60 










Gly 


Leu 


Thr 


Ser 


Gly 


Thr 


Lys 


Val 


He 


Phe 


Glu 


Gly 


Thr 


Thr 


Thr 


Phe 


65 










70 










75 










80 


Gin 


Tyr 


Glu 


Glu 


Trp 


Ala 


Gly 


Pro 


Leu 


He 


Ser 


Met 


Ser 


Gly 


Glu 


His 










85 










90 










95 




He 


Thr 


Val 


Thr 


Gly 


Ala 


Ser 


Gly His 


Leu 


He 


Asn 


Cys 


Asp 


Gly 


Ala 








100 










105 










110 






Arg 


Trp 


Trp 


Asp 


Gly 


Lys 


Gly 


Thr 


Ser 


Gly 


Lys 


Lys 


Lys 


Pro 


Lys 


Phe 






115 










120 










125 








Phe 


Tyr 


Ala 


His 


Gly 


Leu 


Asp 


Ser 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Asn 


He 




130 










135 










140 










Lys 


Asn 


Thr 


Pro 


Leu 


Met 


Ala 


Phe 


Ser 


Val 


Gin 


Ala 


Asn 


Asp 


He 


Thr 


145 










150 










155 










160 


Phe 


Thr 


Asp 


Val 


Thr 


He 


Asn 


Asn 


Ala 


Asp 


Gly Asp 


Thr 


Gin 


Gly 


Gly 










165 










170 










175 




His 


Asn 


Thr 


Asp 


Ala 


Phe 


Asp 


Val 


Gly Asn 


Ser 


Val 


Gly 


Val 


Asn 


He 








180 










185 










190 
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lie 


Lys 


Pro 
195 


Trp 


Val 


His 


Asn 


Gin 
200 


Gly Glu 


Asn 


He 


Trp 


Phe 


Thr 


Gly 




210 










215 




Leu 


Ser 


lie 


Gly 


Ser 


Val 


Gly 


Asp 


225 










230 






Val 


Thr 


lie 


Glu 


His 
245 


Ser 


Thr 


Val 


lie 


Lys 


Thr 


He 
260 


Ser 


Gly 


Ala 


Thr 


Ser 


Asn 


He 
275 


Val 


Met 


Ser 


Gly 


He 
280 


Gin 


Asp 
290 


Tyr 


Glu 


Asp 


Gly 


Lys 
295 


Pro 


Thr 


lie 


Gin 


Asp 


Val 


Lys 


Leu 


Glu 


305 










310 






Gly Ala 


Thr 


Glu 


He 


Tyr 


Leu 


Leu 










325 








Trp 


Thr 


Trp 


Asp 
340 


Asp 


Val 


Lys 


Val 


Cys 


Lys 


Asn 
355 


Phe 


Pro 


Ser 


Val 


Ala 
360 



<210> 38 

<211> 383 

<212> PRT 

<213> Pseudomonas cellulosa 



<400> 38 



Arg 


Ala 


Asp 


Val 


Lys 


Pro 


Val 


Thr 


1 








5 








Thr 


Met 


Glu 


Thr 


Arg 


Ser 


Leu 


Phe 








20 










His 


Ser 


He 


Met 


Phe 


Gly 


His 


Gin 






35 










40 


He 


Thr 


Arg 


Thr 


Asp 


Gly 


Thr 


Gin 




50 










55 




Asp 


Phe 


Ala 


Ala 


Val 


Tyr 


Gly 


Trp 


65 










70 






Lys 


Ala 


Glu 


Gly 


Asp 


He 


Val 


Ala 










85 








Gly 


Gly 


He 


He 


Thr 


Val 


Ser 


Ser 








100 










Thr 


Gin 


Lys 


Gly 


Val 


Trp 


Pro 


Val 






115 










120 


Ala 


Val 


Val 


Asp 


Ser 


Leu 


Pro 


Gly 




130 










135 




Gly 


Tyr 


Leu 


Asp 


Gin 


Val 


Ala 


Glu 


145 










150 






Gin 


Gly 


Arg 


Leu 


He 


Pro 


Val 


He 










165 








Gly 


Ser 


Trp 


Phe 


Trp 


Trp 


Gly 


Asp 








180 










Lys 


Gin 


Leu 


Phe 


Arg 


Tyr 


Ser 


Val 






195 










200 


Val 


Arg 


Asn 


Phe 


Leu 


Tyr 


Ala 


Tyr 




210 










215 




Thr 


Glu 


Ala 


Asn 


Tyr 


Leu 


Glu 


Arg 


225 










230 






Val 


Leu 


Gly 


Phe 


Asp 


Thr 


Tyr 


Gly 










245 








Phe 


Arg 


Asn 


Val 


Val 


Ala 


Asn 


Ala 



27 



Asp 


Asp 


Cys 


Leu 


Ala 


Val 


Asn 


Ser 










205 








Gly 


Thr 


Cys 


He 


Gly 


Gly 


His 


Gly 








22 0 










Arg 


Ser 


Asn 


Asn 


Val 


Val 


Lys 


Asn 






235 










240 


Ser 


Asn 


Ser 


Glu 


Asn 


Ala 


Val 


Arg 




250 










255 




Gly 


Ser 


Val 


Ser 


Glu 


He 


Thr 


Tyr 


2 65 










27 0 






Ser 


Asp 


Tyr 


Gly 


Val 


Val 


He 


Gin 










285 








Thr 


Gly 


Lys 


Pro 


Thr 


Asn 


Gly 


Val 








300 










Ser 


Val 


Thr 


Gly 


Ser 


Val 


Asp 


Ser 






315 










320 


Cys 


Gly 


Ser 


Gly 


Ser 


Cys 


Ser 


Asp 




330 










335 




Thr 


Gly 


Gly 


Lys 


Lys 


Ser 


Thr 


Ala 


345 










350 






Ser 


Cys 














Val 


Lys 


Leu 


Val 


Asp 


Ser 


Gin 


Ala 




10 










15 




Ala 


Phe 


Met 


Gin 


Glu 


Gin 


Arg 


Arg 


25 










30 






His 


Glu 


Thr 


Thr 


Gin 


Gly 


Leu 


Thr 










45 








Ser 


Asp 


Thr 


Phe 


Asn 


Ala 


Val 


Gly 








60 










Asp 


Thr 


Leu 


Ser 


He 


Val 


Ala 


Pro 






75 










80 


Gin 


Val 


Lys 


Lys 


Ala 


Tyr 


Ala 


Arg 




90 










95 




His 


Phe 


Asp 


Asn 


Pro 


Lys 


Thr 


Asp 


105 










110 






Gly 


Thr 


Ser 


Trp 


Asp 


Gin 


Thr 


Pro 










125 








Gly 


Ala 


Tyr 


Asn 


Pro 


Val 


Leu 


Asn 








14 0 










Trp 


Ala 


Asn 


Asn 


Leu 


Lys 


Asp 


Glu 






155 










160 


Phe 


Arg 


Leu 


Tyr 


His 


Ala 


Asn 


Thr 




170 










175 




Lys 


Gin 


Ser 


Thr 


Pro 


Glu 


Gin 


Tyr 


185 










190 






Glu 


Tyr 


Leu 


Arg 


Asp 


Val 


Lys 


Gly 










2 05 








Ser 


Pro 


Asn 


Asn 


Phe 


Trp 


Asp 


Val 








220 










Tyr 


Pro 


Gly 


Asp 


Glu 


Trp 


Val 


Asp 






235 










240 


Pro 


Val 


Ala 


Asp 


Asn 


Ala 


Asp 


Trp 




250 










255 




Ala 


Leu 


Val 


Ala 


Arg 


Met 


Ala 


Glu 
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260 










265 










270 






Ala 


Arg 


Gly Lys 


lie 


Pro 


Val 


lie 


Ser 


Glu 


He 


Gly 


He 


Arg 


Ala 


Pro 






275 










280 










285 








Asp 


lie 
290 


Glu 


Ala 


Gly 


Leu 


Tyr 
295 


Asp 


Asn 


Gin 


Trp 


Tyr 
300 


Arg 


Lys 


Leu 


He 


Ser 


Gly 


Leu 


Lys 


Ala 


Asp 


Pro 


Asp 


Ala 


Arg 


Glu 


He 


Ala 


Phe 


Leu 


Leu 


305 










310 










315 










320 


Val 


Trp 


Arg 


Asn 


Ala 


Pro 


Gin 


Gly 


Val 


Pro 


Gly 


Pro 


Asn Gly 


Thr 


Gin 










325 










330 










335 




Val 


Pro 


His 


Tyr 
340 


Trp 


Val 


Pro 


Ala 


Asn 
345 


Arg 


Pro 


Glu 


Asn 


He 
350 


Asn 


Asn 


Gly 


Thr 


Leu 
355 


Glu 


Asp 


Phe 


Gin 


Ala 
360 


Phe 


Tyr 


Ala 


Asp 


Glu 

365 


Phe 


Thr 


Ala 


Phe 


Asn 
370 


Arg 


Asp 


lie 


Glu 


Gin 
375 


Val 


Tyr 


Gin 


Arg 


Pro 
380 


Thr 


Leu 


He 





<210> 39 

<211> 419 

<212> PRT 

<213> Bacillus circulans 



<400> 39 



Leu 


Gin 


Pro 


Ala 


Thr 


Ala 


Glu 


Ala 


Ala 


Asp 


Ser 


Tyr 


Lys 


He 


Val 


Gly 


1 








5 










10 










15 




Tyr 


Tyr 


Pro 


Ser 


Trp 


Ala 


Ala 


Tyr 


Gly 


Arg 


Asn 


Tyr 


Asn 


Val 


Ala 


Asp 








20 










25 










30 






He 


Asp 


Pro 


Thr 


Lys 


Val 


Thr 


His 


He 


Asn 


Tyr 


Ala 


Phe 


Ala 


Asp 


He 






35 










40 










45 








Cys 


Trp 


Asn 


Gly 


He 


His 


Gly 


Asn 


Pro 


Asp 


Pro 


Ser 


Gly 


Pro 


Asn 


Pro 




50 










55 










60 










Val 


Thr 


Trp 


Thr 


Cys 


Gin 


Asn 


Glu 


Lys 


Ser 


Gin 


Thr 


He 


Asn 


Val 


Pro 


65 










70 










75 










80 


Asn 


Gly 


Thr 


He 


Val 


Leu 


Gly 


Asp 


Pro 


Trp 


He 


Asp 


Thr 


Gly 


Lys 


Thr 










85 










90 










95 




Phe 


Ala 


Gly 


Asp 


Thr 


Trp 


Asp 


Gin 


Pro 


He 


Ala 


Gly 


Asn 


He 


Asn 


Gin 








10 0 










105 










110 






Leu 


Asn 


Lys 


Leu 


Lys 


Gin 


Thr 


Asn 


Pro 


Asn 


Leu 


Lys 


Thr 


He 


He 


Ser 






115 










120 










125 








Val 


Gly 


Gly 


Trp 


Thr 


Trp 


Ser 


Asn 


Arg 


Phe 


Ser 


Asp 


Val 


Ala 


Ala 


Thr 




130 










135 










140 










Ala 


Ala 


Thr 


Arg 


Glu 


Val 


Phe 


Ala 


Asn 


Ser 


Ala 


Val 


Asp 


Phe 


Leu 


Arg 


145 










150 










155 










160 


Lys 


Tyr 


Asn 


Phe 


Asp 


Gly 


Val 


Asp 


Leu 


Asp 


Trp 


Glu 


Tyr 


Pro 


Val 


Ser 










165 










170 










175 




Gly 


Gly 


Leu 


Asp 


Gly 


Asn 


Ser 


Lys 


Arg 


Pro 


Glu 


Asp 


Lys 


Gin 


Asn 


Tyr 








180 










185 










190 






Thr 


Leu 


Leu 


Leu 


Ser 


Lys 


He 


Arg 


Glu 


Lys 


Leu 


Asp 


Ala 


Ala 


Gly 


Ala 






195 










200 










205 








Val 


Asp 


Gly 


Lys 


Lys 


Tyr 


Leu 


Leu 


Thr 


He 


Ala 


Ser 


Gly Ala 


Ser 


Ala 




210 










215 










220 










Thr 


Tyr 


Ala 


Ala 


Asn 


Thr 


Glu 


Leu 


Ala 


Lys 


He 


Ala 


Ala 


He 


Val 


Asp 


225 










230 










235 










240 


Trp 


He 


Asn 


He 


Met 


Thr 


Tyr 


Asp 


Phe 


Asn 


Gly Ala 


Trp 


Gin 


Lys 


He 










245 










250 










255 




Ser 


Ala 


His 


Asn 


Ala 


Pro 


Leu 


Asn 


Tyr 


Asp 


Pro 


Ala 


Ala 


Ser 


Ala 


Ala 








260 










265 










270 






Gly 


Val 


Pro 


Asp 


Ala 


Asn 


Thr 


Phe 


Asn 


Val 


Ala 


Ala 


Gly Ala 


Gin 


Gly 






275 










280 










285 








His 


Leu 


Asp 


Ala 


Gly 


Val 


Pro 


Ala 


Ala 


Lys 


Leu 


Val 


Leu 


Gly 


Val 


Pro 




290 










295 










300 










Phe 


Tyr 


Gly 


Arg 


Gly 


Trp 


Asp 


Gly 


Cys 


Ala 


Gin 


Ala 


Gly Asn 


Gly 


Gin 


305 










310 










315 










320 
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Tyr 


Gin 


Thr 


Cys 


Thr 


Gly 


Gly 


Ser 


Ser 


Val 


Gly 


Thr 


Trp 


Glu 


Ala 


Gly 










325 










330 










335 




Ser 


Phe 


Asp 


Phe 


Tyr 


Asp 


Leu 


Glu 


Ala 


Asn 


Tyr 


He 


Asn 


Lys 


Asn 


Gly 








340 










345 










3 5 0 






Tyr 


Thr 


Arg 


Tyr 


Trp 


Asn 


Asp 


Thr 


Ala 


Lys 


Val 


Pro 


Tyr 


Leu 


Tyr 


Asn 






355 










360 










365 








Ala 


Ser 


Asn 


Lys 


Arg 


Phe 


He 


Ser 


Tyr 


Asp 


Asp 


Ala 


Glu 


Ser 


Val 


Gly 




370 










375 










380 










Tyr 


Lys 


Thr 


Ala 


Tyr 


He 


Lys 


Ser 


Lys 


Gly 


Leu 


Gly 


Gly 


Ala 


Met 


Phe 


385 










390 










395 










4 0 0 


Trp 


Glu 


Leu 


Ser 


Gly 


Asp 


Arg 


Asn 


Lys 


Thr 


Leu 


Gin 


Asn 


Lys 


Leu 


Lys 










405 










410 










415 




Ala 


Asp 


Leu 




























<210> 40 




























<211> 317 




























<212> PRT 




























<213> Candida antarctica 




















<400> 40 




























Leu 


Pro 


Ser 


Gly 


Ser 


Asp 


Pro 


Ala 


Phe 


Ser 


Gin 


Pro 


Lys 


Ser 


Val 


Leu 


1 








5 










10 










15 




Asp 


Ala 


Gly 


Leu 


Thr 


Cys 


Gin 


Gly Ala 


Ser 


Pro 


Ser 


Ser 


Val 


Ser 


Lys 








20 










25 










3 0 






Pro 


He 


Leu 


Leu 


Val 


Pro 


Gly 


Thr 


Gly 


Thr 


Thr 


Gly 


Pro 


Gin 


Ser 


Phe 






35 










40 










45 








Asp 


Ser 


Asn 


Trp 


He 


Pro 


Leu 


Ser 


Thr 


Gin 


Leu 


Gly 


Tyr 


Thr 


Pro 


Cys 




50 










55 










60 










Trp 


He 


Ser 


Pro 


Pro 


Pro 


Phe 


Met 


Leu 


Asn 


Asp 


Thr 


Gin 


Val 


Asn 


Thr 


65 










70 










75 










80 


Glu 


Tyr 


Met 


Val 


Asn 


Ala 


He 


Thr 


Ala 


Leu 


Tyr 


Ala 


Gly 


Ser 


Gly 


Asn 










85 










90 










95 




Asn 


Lys 


Leu 


Pro 


Val 


Leu 


Thr 


Trp 


Ser 


Gin 


Gly 


Gly 


Leu 


Val 


Ala 


Gin 








100 










105 










110 






Trp 


Gly 


Leu 


Thr 


Phe 


Phe 


Pro 


Ser 


He 


Arg 


Ser 


Lys 


Val 


Asp 


Arg 


Leu 






115 










120 










125 








Met 


Ala 


Phe 


Ala 


Pro 


Asp 


Tyr 


L Y S 


Gly 


Thr 


Val 


Leu 


Ala 


Gly 


Pro 


Leu 




130 










135 










140 










Asp 


Ala 


Leu 


Ala 


Val 


Ser 


Ala 


Pro 


Ser 


Val 


Trp 


Gin 


Gin 


Thr 


Thr 


Gly 


145 










150 










155 










160 


Ser 


Ala 


Leu 


Thr 


Thr 


Ala 


Leu 


Arg 


Asn 


Ala 


Gly 


Gly 


Leu 


Thr 


Gin 


He 










165 










170 










175 




Val 


Pro 


Thr 


Thr 


Asn 


Leu 


Tyr 


Ser 


Ala 


Thr 


Asp 


Glu 


He 


Val 


Gin 


Pro 








180 










185 










19 0 






Gin 


Val 


Ser 


Asn 


Ser 


Pro 


Leu 


Asp 


Ser 


Ser 


Tyr 


Leu 


Phe 


Asn 


Gly 


Lys 






195 










200 










2 05 








Asn 


Val 


Gin 


Ala 


Gin 


Ala 


Val 


Cys 


Gly 


Pro 


Leu 


Phe 


Val 


He 


Asp 


His 




210 










215 










220 










Ala 


Gly 


Ser 


Leu 


Thr 


Ser 


Gin 


Phe 


Ser 


Tyr 


Val 


Val 


Gly 


Arg 


Ser 


Ala 


225 










230 










2 3 5 










24 0 


Leu 


Arg 


Ser 


Thr 


Thr 


Gly 


Gin 


Ala 


Arg 


Ser 


Ala 


Asp 


Tyr 


Gly 


He 


Thr 










245 










2 5 0 










255 




Asp 


Cys 


Asn 


Pro 


Leu 


Pro 


Ala 


Asn 


Asp 


Leu 


Thr 


Pro 


Glu 


Gin 


Lys 


Val 








260 










265 










270 






Ala 


Ala 


Ala 


Ala 


Leu 


Leu 


Ala 


Pro 


Ala 


Ala 


Ala 


Ala 


He 


Val 


Ala 


Gly 






275 










280 










285 








Pro 


Lys 


Gin 


Asn 


Cys 


Glu 


Pro 


Asp 


Leu 


Met 


Pro 


Tyr 


Ala 


Arg 


Pro 


Phe 




290 










295 










300 










Ala 


Val 


Gly 


Lys 


Arg 


Thr 


Cys 


Ser 


Gly 


He 


Val 


Thr 


Pro 








305 










310 










315 
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<210> 41 

<211> 434 

<212> PRT 

<213> artificial sequence 
<220> 

<2 23> chimera of guinea pig and homo sapiens (human= approx . last 3 0 am 
ino acids) 



<400> 41 

Ala Glu Val Cys 

1 

Trp Ala Gly Thr 
20 

Lys Lys lie Asn 
35 

Ser Tyr Gin Leu 
50 

Asn Phe Asn Leu 
65 

Asp Ser Gly Glu 

Gin Val Glu Lys 
100 

Lys Ala Gin Tyr 
115 

Glu Val Ala Tyr 
130 

Pro Glu Asn Val 
145 

Gly Glu Ala Gly 

Leu Asp Pro Ala 
180 

Leu Asp Pro Ser 
195 

Ser Pro lie Leu 
210 

His Met Asp Phe 
225 

Thr Gly He Ser 

Ser He Leu Asn 
260 

Asp Glu Phe Gin 
275 

Pro Lys Met Gly 
290 

Val Glu Gin Thr 
305 

Arg Trp Arg Tyr 

Ser Gly Asn He 
340 

Gin Tyr Gin Val 
355 

Asn Ser He Asp 
370 

Phe Leu Trp Lys 
385 

Ala Ser Arg He 
Phe Cys Ser Ser 



Tyr 


Ser 


His 


Leu 


5 








Ser 


Gin 


Arg 


Pro 


Thr 


Arg 


Phe 


Leu 








4 0 


He 


Thr 


Ala 


Thr 






55 




Asn 


Arg 


Lys 


Thr 




70 






Asn 


Ser 


Trp 


Leu 


85 








Val 


Asn 


Cys 


He 


Ser 


Gin 


Ala 


Ser 








12 0 


Leu 


Val 


Gin 


Val 






135 




His 


He 


He 


Gly 




150 






Lys 


Arg 


Leu 


Asn 


165 








Glu 


Pro 


Tyr 


Phe 


Asp 


Ala 


Lys 


Phe 








2 0 0 


Pro 


Ser 


Leu 


Gly 






215 




Phe 


Pro 


Asn 


Gly 




230 






Cys 


Asn 


His 


His 


245 








Pro 


Glu 


Gly 


Phe 


Glu 


Ser 


Gly 


Cys 








280 


His 


Phe 


Ala 


Asp 






295 




Phe 


Phe 


Leu 


Asn 




310 






Lys 


Val 


Thr 


Val 


325 








Asn 


Val 


Ala 


Leu 


Phe 


Lys 


Gly 


Thr 








360 


Val 


Glu 


Leu 


Asn 






375 




Arg 


Ser 


Gly 


He 




390 






Thr 


Val 


Gin 


Ser 


405 








Asp 


He 


Val 


Gin 



Gly 


Cys 


Phe 


Ser 




10 






He 


Lys 


Ser 


Leu 


25 








Leu 


Tyr 


Thr 


Asn 


Asp 


He 


Ala 


Thr 








6 0 


Arg 


Phe 


He 


He 






75 




Ser 


Asp 


Met 


Cys 




90 






Cys 


Val 


Asp 


Trp 


105 








Gin 


Asn 


He 


Arg 


Leu 


Ser 


Thr 


Ser 








140 


His 


Ser 


Leu 


Gly 






155 




Gly 


Leu 


Val 


Gly 




170 






Gin 


Asp 


Thr 


Pro 


185 








Val 


Asp 


Val 


He 


Phe 


Gly 


Met 


Ser 








22 0 


Gly 


Lys 


Asp 


Met 






235 




Arg 


Ser 


He 


Glu 




250 






Leu 


Gly 


Tyr 


Pro 


265 








Phe 


Pro 


Cys 


Pro 


Gin 


Tyr 


Pro 


Gly 








300 


Thr 


Gly 


Ala 


Ser 






315 




Thr 


Leu 


Ser 


Gly 




330 






Leu 


Gly 


Lys 


Asn 


345 








Leu 


Lys 


Pro 


Asp 


Val 


Gly 


Thr 


He 








380 


Ser 


Val 


Ser 


Lys 






395 




Gly 


Lys 


Asp 


Gly 




410 






Glu 


Asn 


Val 


Glu 



Asp 


Glu 


Lys 


Pro 






15 




Pro 


Ser 


Asp 


Pro 




30 






Glu 


Asn 


Gin 


Asn 


45 








He 


Lys 


Ala 


Ser 


His 


Gly 


Phe 


Thr 








80 


Lys 


Asn 


Met 


Phe 






95 




Lys 


Gly 


Gly 


Ser 




110 






Val 


Val 


Gly 


Ala 


125 








Leu 


Asn 


Tyr 


Ala 


Ala 


His 


Thr 


Ala 








16 0 


Arg 


He 


Thr 


Gly 






175 




Glu 


Glu 


Val 


Arg 




190 






His 


Thr 


Asp 


He 


205 








Gin 


Lys 


Val 


Gly 


Pro 


Gly 


Cys 


Lys 








240 


Tyr 


Tyr 


His 


Ser 






255 




Cys 


Ala 


Ser 


Tyr 




270 






Ala 


Lys 


Gly 


Cys 


285 








Lys 


Thr 


Asn 


Ala 


Asp 


Asn 


Phe 


Thr 








320 


Glu 


Lys 


Asp 


Pro 






335 




Gly 


Asn 


Ser 


Ala 




350 






Ala 


Ser 


Tyr 


Thr 


365 








Gin 


Lys 


Val 


Thr 


Pro 


Lys 


Met 


Gly 








400 


Thr 


Lys 


Tyr 


Asn 






415 




Gin 


Thr 


Leu 


Ser 
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31 

420 425 430 

Pro Cys 



<210> 42 

<211> 471 

<212> PRT 

<213> Escherichia coli 



<400> 42 



Met 


Lys 


Gin 


Ser 


Thr 


He 


Ala 


Leu 


Ala 


Leu 


Leu 


Pro 


Leu 


Leu 


Phe 


Thr 


1 








5 










10 










15 




Pro 


Val 


Thr 


Lys 


Ala 


Arg 


Thr 


Pro 


Glu 


Met 


Pro 


Val 


Leu 


Glu 


Asn 


Arg 








20 










25 










30 






Ala 


Ala 


Gin 


Gly 


Asp 


He 


Thr 


Ala 


Pro 


Gly 


Gly 


Ala 


Arg 


Arg 


Leu 


Thr 






35 










40 










45 








Gly Asp 


Gin 


Thr 


Ala 


Ala 


Leu 


Arg 


Asp 


Ser 


Leu 


Ser 


Asp 


Lys 


Pro 


Ala 




50 










55 










60 










Lys 


Asn 


He 


He 


Leu 


Leu 


He 


Gly Asp 


Gly Met 


Gly 


Asp 


Ser 


Glu 


He 


65 










7 0 










75 










80 


Thr 


Ala 


Ala 


Arg 


Asn 


Tyr 


Ala 


Glu 


Gly 


Ala 


Gly 


Gly 


Phe 


Phe 


Lys 


Gly 










85 










90 










95 




He 


Asp 


Ala 


Leu 


Pro 


Leu 


Thr 


Gly 


Gin 


Tyr 


Thr 


His 


Tyr 


Ala 


Leu 


Asn 








100 










105 










110 






Lys 


Lys 


Thr 


Gly 


Lys 


Pro 


Asp 


Tyr 


Val 


Thr 


Asp 


Ser 


Ala 


Ala 


Ser 


Ala 






115 










120 










125 








Thr 


Ala 


Trp 


Ser 


Thr 


Gly 


Val 


Lys 


Thr 


Tyr 


Asn 


Gly Ala 


Leu 


Gly 


Val 




130 










135 










140 










As P 


He 


His 


Glu 


Lys 


Asp 


His 


Pro 


Thr 


He 


Leu 


Glu 


Met 


Ala 


Lys 


Ala 


145 










150 










155 










160 


Ala 


Gly 


Leu 


Ala 


Thr 


Gly Asn 


Val 


Ser 


Thr 


Ala 


Glu 


Leu 


Gin 


Asp 


Ala 










165 










170 










17 5 




Thr 


Pro 


Ala 


Ala 


Leu 


Val 


Ala 


His 


Val 


Thr 


Ser 


Arg 


Lys 


Cys 


Tyr 


Gly 








180 










185 










190 






Pro 


Ser 


Ala 


Thr 


Ser 


Glu 


Lys 


Cys 


Pro 


Gly 


Asn 


Ala 


Leu 


Glu 


Lys 


Gly 






195 










200 










205 








Gly 


Lys 


Gly 


Ser 


He 


Thr 


Glu 


Gin 


Leu 


Leu 


Asn 


Ala 


Arg 


Ala 


Asp 


Val 




210 










215 










220 










Thr 


Leu 


Gly 


Gly 


Gly 


Ala 


Lys 


Thr 


Phe 


Ala 


Glu 


Thr 


Ala 


Thr 


Ala 


Gly 


225 










230 










235 










240 


Glu 


Trp 


Gin 


Gly 


Lys 


Thr 


Leu 


Arg 


Glu 


Gin 


Ala 


Gin 


Ala 


Arg 


Gly 


Tyr 










245 










250 










255 




Gin 


Leu 


Val 


Ser 


Asp 


Ala 


Ala 


Ser 


Leu 


Asn 


Ser 


Val 


Thr 


Glu 


Ala 


Asn 








260 










265 










270 






Gin 


Gin 


Lys 


Pro 


Leu 


Leu 


Gly 


Leu 


Phe 


Ala 


Asp 


Gly Asn 


Met 


Pro 


Val 






275 










280 










285 








Arg 


Trp 


Leu 


Gly 


Pro 


Lys 


Ala 


Thr 


Tyr 


His 


Gly 


Asn 


He 


Asp 


Lys 


Pro 




290 










295 










300 










Ala 


Val 


Thr 


Cys 


Thr 


Pro 


Asn 


Pro 


Gin 


Arg 


Asn 


Asp 


Ser 


Val 


Pro 


Thr 


305 










310 










315 










320 


Leu 


Ala 


Gin 


Met 


Thr 


Asp 


Lys 


Ala 


He 


Glu 


Leu 


Leu 


Ser 


Lys 


Asn 


Glu 










325 










330 










335 




Lys 


Gly 


Phe 


Phe 


Leu 


Gin 


Val 


Glu 


Gly 


Ala 


Ser 


He 


Asp 


Lys 


Gin 


Asp 








340 










345 










350 






His 


Ala 


Ala 


Asn 


Pro 


Cys 


Gly 


Gin 


He 


Gly 


Glu 


Thr 


Val 


Asp 


Leu 


Asp 






355 










360 










365 








Glu 


Ala 


Val 


Gin 


Arg 


Ala 


Leu 


Glu 


Phe 


Ala 


Lys 


Lys 


Glu 


Gly 


Asn 


Thr 




370 










375 










380 










Leu 


Val 


He 


Val 


Thr 


Ala 


Asp 


His 


Ala 


His 


Ala 


Ser 


Gin 


lie 


Val 


Ala 


385 










390 










395 










400 


Pro 


Asp 


Thr 


Lys 


Ala 


Pro 


Gly 


Leu 


Thr 


Gin 


Ala 


Leu 


Asn 


Thr 


Lys 


Asp 










405 










410 










415 
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Gly Ala 


Val 


Met 


Val 


Met 


Ser Tyr 






420 








Glu His 


Thr 


Gly 


Ser 


Gin 


Leu Arg 




435 








440 


Ala Asn 


Val 


Val 


Gly 


Leu 


Thr Asp 


450 










455 


Lys Ala 


Ala 


Leu 


Gly 


Leu 


Lys 


465 








470 





Gly Asn Ser Glu Glu Asp Ser Gin 
425 430 
lie Ala Ala Tyr Gly Pro His Ala 
445 

Gin Thr Asp Leu Phe Tyr Thr Met 
460 



<210> 43 

<211> 260 

<212> PRT 

<213> Bovine 



<400> 43 



Leu 


Lys 


He 


Ala 


Ala 


Phe 


Asn 


He 


Arg 


Thr 


Phe 


Gly 


Glu 


Thr 


Lys 


Met 


1 








5 










10 










15 




Ser 


Asn 


Ala 


Thr 


Leu 


Ala 


Ser 


Tyr 


He 


Val 


Arg 


He 


Val 


Arg 


Arg 


Tyr 








20 










25 










30 






Asp 


He 


Val 


Leu 


He 


Gin 


Glu 


Val 


Arg 


Asp 


Ser 


His 


Leu 


Val 


Ala 


Val 






35 










40 










45 








Gly 


Lys 


Leu 


Leu 


Asp 


Tyr 


Leu 


Asn 


Gin 


Asp 


Asp 


Pro 


Asn 


Thr 


Tyr 


His 




50 










55 










60 










Tyr 


Val 


Val 


Ser 


Glu 


Pro 


Leu 


Gly 


Arg 


Asn 


Ser 


Tyr 


Lys 


Glu 


Arg 


Tyr 


65 










70 










75 










80 


Leu 


Phe 


Leu 


Phe 


Arg 


Pro 


Asn 


Lys 


Val 


Ser 


Val 


Leu 


Asp 


Thr 


Tyr 


Gin 










85 










90 










95 




Tyr 


Asp 


Asp 


Gly 


Cys 


Glu 


Ser 


Cys 


Gly 


Asn 


Asp 


Ser 


Phe 


Ser 


Arg 


Glu 








100 










105 










110 






Pro 


Ala 


Val 


Val 


Lys 


Phe 


Ser 


Ser 


His 


Ser 


Thr 


Lys 


Val 


Lys 


Glu 


Phe 






115 










120 










125 








Ala 


He 


Val 


Ala 


Leu 


His 


Ser 


Ala 


Pro 


Ser 


Asp 


Ala 


Val 


Ala 


Glu 


He 




130 










135 










140 










Asn 


Ser 


Leu 


Tyr 


Asp 


Val 


Tyr 


Leu 


Asp 


Val 


Gin 


Gin 


Lys 


Trp 


His 


Leu 


145 










150 










155 










160 


Asn 


Asp 


Val 


Met 


Leu 


Met 


Gly Asp 


Phe 


Asn 


Ala 


Asp 


Cys 


Ser 


Tyr 


Val 










165 










170 










175 




Thr 


Ser 


Ser 


Gin 


Trp 


Ser 


Ser 


He 


Arg 


Leu 


Arg 


Thr 


Ser 


Ser 


Thr 


Phe 








180 










185 










190 






Gin 


Trp 


Leu 


He 


Pro 


Asp 


Ser 


Ala 


Asp 


Thr 


Thr 


Ala 


Thr 


Ser 


Thr 


Asn 






195 










200 










205 








Cys 


Ala 


Tyr 


Asp 


Arg 


He 


Val 


Val 


Ala 


Gly 


Ser 


Leu 


Leu 


Gin 


Ser 


Ser 




210 










215 










220 










Val 


Val 


Pro 


Gly 


Ser 


Ala 


Ala 


Pro 


Phe 


Asp 


Phe 


Gin 


Ala 


Ala 


Tyr 


Gly 


225 










230 










235 










240 


Leu 


Ser 


Asn 


Glu 


Met 


Ala 


Leu 


Ala 


He 


Ser 


Asp 


His 


Tyr 


Pro 


Val 


Glu 










245 










250 










255 




Val 


Thr 


Leu 


Thr 



























<210> 44 
<211> 686 
<212> PRT 

<213> Bacillus circulans 
<400> 44 

Ala Pro Asp Thr Ser Val Ser Asn 
1 5 

He Tyr Gin He Phe Thr Asp Arg 
20 

Asn Pro Thr Gly Ala Ala Phe Asp 



Lys Gin Asn Phe Ser Thr Asp Val 

10 15 
Phe Ser Asp Gly Asn Pro Ala Asn 
25 30 
Gly Thr Cys Thr Asn Leu Arg Leu 
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35 










40 




Tyr 


Cys 


Gly 


Gly 


Asp 


Trp 


Gin 


Gly 


He 




50 










55 






Tyr 


Leu 


Thr 


Gly 


Met 


Gly Val 


Thr 


Ala 


65 










70 








Glu 


Asn 


He 


Tyr 


Ser 


He 


He 


Asn 


Tyr 










85 










Tyr 


His 


Gly 


Tyr 


Trp 


Ala 


Arg 


Asp 


Phe 








100 










105 


Gly 


Thr 


He 


Ala 


Asp 


Phe 


Gin 


Asn 


Leu 






115 










120 




Asn 


lie 


Lys 


Val 


He 


He 


Asp 


Phe 


Ala 




130 










135 






Ser 


Ser 


Asp 


Gin 


Pro 


Ser 


Phe 


Ala 


Glu 


145 










150 








Gly 


Thr 


Leu 


Leu 


Gly 


Gly 


Tyr 


Thr 


Asn 










165 










His 


Asn 


Gly 


Gly 


Thr 


Asp 


Phe 


Ser 


Thr 








180 










185 


Asn 


Leu 


Tyr 


ASp 


Leu 


Ala 


Asp 


Leu 


Asn 






195 










200 




Val 


Tyr 


Leu 


Lys 


Asp 


Ala 


He 


Lys 


Met 




210 










215 






Gly 


lie 


Arg 


Met 


Asp 


Ala 


Val 


Lys 


His 


225 










230 








Ser 


Phe 


Met 


Ala 


Ala 


Val 


Asn 


Asn 


Tyr 










245 










Glu 


Trp 


Phe 


Leu 


Gly Val 


Asn 


Glu 


Val 








260 










265 


Ala 


Asn 


Glu 


Ser 


Gly Met 


Ser 


Leu 


Leu 






275 










280 




Val 


Arg 


Gin 


Val 


Phe 


Arg 


Asp 


Asn 


Thr 




290 










295 






Ala 


Met 


Leu 


Glu 


Gly 


Ser 


Ala 


Ala 


Asp 


305 










310 








Val 


Thr 


Phe 


He 


Asp 


Asn 


His 


Asp 


Met 










325 










Ala 


Asn 


Arg 


Arg 


Lys 


Leu 


Glu 


Gin 


Ala 








340 










345 


Arg 


Gly 


Val 


Pro 


Ala 


He 


Tyr 


Tyr 


Gly 






355 










360 




Gly 


Thr 


Asp 


Pro 


Asp 


Asn 


Arg 


Ala 


Arg 




370 










375 






Thr 


Thr 


Ala 


Tyr 


Gin 


Val 


He 


Gin 


Lys 


385 










390 








Asn 


Pro 


Ala 


He 


Ala 


Tyr 


Gly 


Ser 


Thr 










405 










Asp 


Val 


Leu 


He 


Tyr 


Glu 


Arg 


Lys 


Phe 








420 










425 


Ala 


Val 


Asn 


Arg 


Asn 


Leu 


Asn 


Ala 


Pro 






435 










440 




Thr 


Ser 


Leu 


Pro 


Gin 


Gly 


Ser 


Tyr 


Asn 




450 










455 






Asn 


Gly 


Asn 


Thr 


Leu 


Ser 


Val 


Gly 


Ser 


465 










470 








Thr 


Leu 


Ala 


Ala 


Gly 


Gly 


Thr 


Ala 


Val 










485 










Ala 


Thr 


Pro 


Thr 


He 


Gly His 


Val 


Gly 








500 










505 


Val 


Thr 


He 


Thr 


He 


Asp 


Gly Arg Gly 






515 










520 




Val 


Tyr 


Phe 


Gly 


Thr 


Thr 


Ala 


Val 


Ser 



530 535 
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45 



He 


Asn 


Lys 


He 


Asn 


Asp 


Gly 






60 










He 


Trp 


He 


Ser 


Gin 


Pro 


Val 




75 










80 


Ser 


Gly 


Val 


Asn 


Asn 


Thr 


Ala 


90 










95 




Lys 


Lys 


Thr 


Asn 


Pro 


Ala 


Tyr 










110 






He 


Ala 


Ala 


Ala 


His 


Ala 


Lys 








125 








Pro 


Asn 


His 


Thr 


Ser 


Pro 


Ala 






140 










Asn 


Gly 


Arg 


Leu 


Tyr 


Asp 


Asn 




155 










160 


Asp 


Thr 


Gin 


Asn 


Leu 


Phe 


His 


170 










175 




Thr 


Glu 


Asn 


Gly 


He 


Tyr 


Lys 










190 






His 


Asn 


Asn 


Ser 


Thr 


Val 


Asp 








205 








Trp 


Leu 


Asp 


Leu 


Gly 


He 


Asp 






220 










Met 


Pro 


Phe 


Gly Trp 


Gin 


Lys 




235 










240 


Lys 


Pro 


Val 


Phe 


Thr 


Phe 


Gly 


250 










255 




Ser 


Pro 


Glu 


Asn 


His 


Lys 


Phe 










270 






Asp 


Phe 


Arg 


Phe 


Ala 


Gin 


Lys 








285 








Asp 


Asn 


Met 


Tyr 


Gly 


Leu 


Lys 






300 










Tyr 


Ala 


Gin 


Val 


Asp 


Asp 


Gin 




315 










320 


Glu 


Arg 


Phe 


His 


Ala 


Ser 


Asn 


330 










335 




Leu 


Ala 


Phe 


Thr 


Leu 


Thr 


Ser 










350 






Thr 


Glu 


Gin 


Tyr 


Met 


Ser 


Gly 








365 








He 


Pro 


Ser 


Phe 


Ser 


Thr 


Ser 






380 










Leu 


Ala 


Pro 


Leu 


Arg 


Lys 


Cys 




395 










400 


Gin 


Glu 


Arg 


Trp 


He 


Asn 


Asn 


410 










415 




Gly 


Ser 


Asn 


Val 


Ala 


Val 


Val 










430 






Ala 


Ser 


He 


Ser 


Gly 


Leu 


Val 








445 








Asp 


Val 


Leu 


Gly 


Gly 


Leu 


Leu 






460 










Gly 


Gly 


Ala 


Ala 


Ser 


Asn 


Phe 




475 










480 


Trp 


Gin 


Tyr 


Thr 


Ala 


Ala 


Thr 


490 










495 




Pro 


Met 


Met 


Ala 


Lys 


Pro 


Gly 










510 






Phe 


Gly 


Ser 


Ser 


Lys 


Gly 


Thr 








525 








Gly 


Ala 


Asp 


He 


Thr 


Ser 


Trp 



540 
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Glu Asp Thr Gin 
545 

Tyr Asn lie Lys 

Asp Asn Phe Glu 
580 

Val Asn Asn Ala 
595 

Ser Val Ser Glu 
610 

Met Tyr Asn Gin 

625 

Ser Val Pro Ala 

Gly Ser Thr Val 
660 

Pro Ser Ser Gly 
675 



lie Lys Val Lys 
550 

Val Ala Asn Ala 
565 

Val Leu Ser Gly 

Thr Thr Ala Leu 
600 

Leu Gly Asn Trp 
615 

Val Val Tyr Gin 
630 

Gly Lys Thr lie 
645 

Thr Trp Glu Gly 

Thr Ala Thr lie 
680 



lie Pro Ala Val 
555 

Ala Gly Thr Ala 
570 

Asp Gin Val Ser 
585 

Gly Gin Asn Val 

Asp Pro Ala Lys 
620 

Tyr Pro Asn Trp 
635 

Glu Phe Lys Phe 
650 

Gly Ser Asn His 
665 

Asn Val Asn Trp 



Ala Gly Gly Asn 
560 

Ser Asn Val Tyr 
575 

Val Arg Phe Val 
590 

Tyr Leu Thr Gly 
605 

Ala lie Gly Pro 

Tyr Tyr Asp Val 
640 

Leu Lys Lys Gin 
655 

Thr Phe Thr Ala 

670 
Gin Pro 
685 



<210> 45 

<211> 404 

<212> PRT 

<213> Amycolatopsis orientalis 



<400> 45 

Met Arg Val Leu 

1 

Leu Val Ala Leu 
20 

Met Cys Leu Pro 
35 

Pro Met Val Pro 
50 

Gly Glu Leu Pro 
65 

Glu Trp Phe Asp 

Val Thr Thr Gly 
100 

Glu Lys Leu Gly 
115 

'Leu Pro Ser Glu 
130 

Ala Asp Arg Leu 
145 

Gly Leu Pro Pro 

Pro Trp Leu Ala 
180 

Leu Gly Thr Val 
195 

Leu Ser Ala Glu 
210 

Tyr Val Gly Phe 
225 

Lys Met Ala lie 

Ser Arg Gly Trp 
260 

Phe Val Val Gly 
275 

Ala Ala He His 



He 


Thr 


Gly 


Cys 


5 








Ala 


Ala 


Arg 


Leu 


Pro 


Asp 


Tyr 


Val 








40 


Val 


Gly 


Arg* 


Ala 






55 




Pro 


Gly 


Ala 


Ala 




70 






Lys 


Val 


Pro 


Ala 


85 








Leu 


Leu 


Pro 


Ala 


He 


Pro 


Tyr 


Arg 








120 


Gin 


Ser 


Gin 


Ala 






135 




Phe 


Gly 


Asp 


Ala 




150 






Val 


Glu 


His 


Leu 


165 








Ala 


Asp 


Pro 


Val 


Gin 


Thr 


Gly 


Ala 








200 


Leu 


Glu 


Ala 


Phe 






215 




Gly 


Ser 


Ser 


Ser 




230 






Lys 


Ala 


Val 


Arg 


245 








Ala 


Asp 


Leu 


Val 


Glu 


Val 


Asn 


Leu 








280 


His 


Asp 


Ser 


Ala 



Gly Ser Arg Gly 
10 

Arg Glu Leu Gly 
25 

Glu Arg Cys Ala 

Val Arg Ala Gly 
60 

Glu Val Val Thr 
75 

Ala He Glu Gly 
90 

Ala Val Ala Val 
105 

Tyr Thr Val Leu 

Glu Arg Asp Met 
140 

Val Asn Ser His 
155 

Tyr Asp Tyr Gly 
170 

Leu Ser Pro Leu 
185 

Trp He Leu Pro 

Leu Ala Ala Gly 
220 

Arg Pro Ala Thr 
235 

Ala Ser Gly Arg 
250 

Leu Pro Asp Asp 
265 

Gin Glu Leu Phe 
Gly Thr Thr Leu 



Asp Thr Glu Pro 
15 

Ala Asp Ala Arg 
30 

Glu Val Gly Val 
45 

Ala Arg Glu Pro 

Glu Val Val Ala 
80 

Cys Asp Ala Val 
95 

Arg Ser Met Ala 
110 

Ser Pro Asp His 
125 

Tyr Asn Gin Gly 

Arg Ala Ser He 
160 

Tyr Thr Asp Gin 
175 

Arg Pro Thr Asp 
190 

Asp Glu Arg Pro 
205 

Ser Thr Pro Val 

Ala Asp Ala Ala 
240 

Arg He Val Leu 
255 

Gly Ala Asp Cys 
270 

Gly Arg Val Ala 
285 

Leu Ala Met Arg 



WO 2004/113522 



35 



PCT/EP2004/051173 



290 295 300 

Ala Gly He Pro Gin He Val Val Arg Arg Val Val Asp Asn Val Val 
305 310 *315 320 

Glu Gin Ala Tyr His Ala Asp Arg Val Ala Glu Leu Gly Val Gly Val 

325 330 335 

Ala Val Asp Gly Pro Val Pro Thr He Asp Ser Leu Ser Ala Ala Leu 

340 345 350 

Asp Thr Ala Leu Ala Pro Glu lie Arg Ala Arg Ala Thr Thr Val Ala 

355 360 365 

Asp Thr He Arg Ala Asp Gly Thr Thr Val Ala Ala Gin Leu Leu Phe 

370 375 380 

Asp Ala Val Ser Leu Glu Lys Pro Thr Val Pro Ala Leu Glu His His 
385 390 395 400 

His His His His 



<210> 46 

<211> 292 

<212> PRT 

<213> Pseudomonas sp . 



<400> 46 

Ser He Glu Arg 

1 

Ala Trp Asp His 
20 

Ser Ala Gly Asp 
35 

He Ala Val Gin 
50 

Glu Val Asp Asp 
65 

Gin Ala Gly Val 

Arg Lys Val Met 
100 

Leu Glu He Tyr 
115 

Pro Ser Ala Pro 
130 

His Phe Val Arg 
145 

Thr Glu Val Leu 

Gly Pro Glu Thr 
180 

His His Thr He 
195 

His Phe Met Leu 
210 

Asp Arg Leu Asp 
225 

Thr Asn Asp Gin 

He Glu Val Glu 
260 

Thr Val Ala Arg 
275 

Arg Gly Gin Arg 
290 



Leu Gly Tyr Leu 
5 

Phe Leu Thr Lys 

Ala Ala Leu Tyr 
40 

Pro Gly Glu Leu 
55 

Ala Ala Ala Leu 
70 

Ala Phe Thr Arg 
85 

Gly Leu Leu Cys 

Tyr Gly Pro Ala 
120 

Val Ser Gly Phe 
135 

Cys Val Pro Asp 
150 

Gly Phe Val Leu 
165 

Ser Val Pro Ala 

Ala Leu Ala Ala 
200 

Gin Ala Asn Thr 
215 

Ala Ala Gly Arg 
230 

Thr Leu Ser Phe 
245 

Phe Gly Trp Gly 

His Ser Arg Thr 
280 



Gly Phe Ala Val 
10 

Ser Val Gly Leu 
25 

Arg Ala Asp Gin 

Asp Asp Leu Ala 
60 

Glu Arg Met Ala 
75 

Gly Asp Glu Ala 
90 

Leu Gin Asp Pro 
105 

Glu He Phe His 

Val Thr Gly Asp 
140 

Thr Ala Lys Ala 
155 

Ser Asp He He 
170 

His Phe Leu His 
185 

Phe Pro He Pro 

He Asp Asp Val 
220 

He Thr Ser Leu 
235 

Tyr Ala Asp Thr 
250 

Pro Arg Thr Val 
265 

Ala Met Trp Gly 



Lys Asp Val Pro 
15 

Met Ala Ala Gly 
30 

Arg Ala Trp Arg 
45 

Tyr Ala Gly Leu 

Asp Lys Leu Arg 
80 

Leu Met Gin Gin 
95 

Phe Gly Leu Pro 
110 

Glu Pro Phe Leu 
125 

Gin Gly He Gly 

Met Ala Phe Tyr 
160 

Asp He Gin Met 
175 

Cys Asn Gly Arg 
190 

Lys Arg He His 
205 

Gly Tyr Ala Phe 

Leu Gly Arg His 
240 

Pro Ser Pro Met 
255 

Asp Ser Ser Trp 
270 

His Lys Ser Val 
285 
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<210> 47 

<211> 311 

<212> PRT 

<213> Acitenobacter sp . 



<400> 47 

Met Glu Val Lys 

1 

Val Ala Ser Gly 
20 

lie lie His Arg 
35 

Asn lie Thr Ser 
50 

Leu Gly Ala Asn 
65 

Asp His Tyr Leu 

lie Glu Asn Ala 
100 

Gly Ala Pro Glu 
115 

Pro Asn Gly His 
130 

Gly Lys Pro Leu 
145 

Lys Gly Phe Tyr 

Asn Met Arg Arg 
180 

Arg Thr lie Leu 
195 

Gin Gin Leu Leu 
210 

lie His Tyr Phe 
225 

lie Asn Val Ala 

Thr Arg Glu Gly 
260 

Ala lie Lys Ala 
275 

Asp Leu Lys Leu 
290 

Asp Arg Pro Arg 
305 



lie Phe Asn Thr 
5 

Leu Glu Gin Glu 

Val Leu Ser Asp 
40 

Asp Glu Tyr Trp 
55 

Gin Glu Ala Gly 
70 

Asp Met Arg Met 
85 

Thr Pro Arg Thr 

Ser Val Gly Tyr 
120 

Thr Leu lie Leu 
135 

Pro Asn Ala Lys 
150 

Ser His Phe Asp 
165 

Ser lie lie Thr 

Pro Ala Gly Tyr 
200 

Asn Gin Leu Gly 
215 

Val Ser Ala Asp 
230 

Gly Asp Pro Tyr 
245 

Leu Val Val Asp 

Asn Asp Val Glu 
280 

Thr Arg Leu Val 
295 

Leu Ala Val 
310 



Gin Asp Val Gin 
10 

Gly Gly Asn Pro 
25 

Leu Tyr Lys Ala 

Ala Gly Val Ala 
6.0 

Leu Leu Ser Pro 
75 

Asp Ala Glu Asp 
90 

He Glu Gly Pro 
105 

Ala Arg Met Asp 

His Gly Thr He 
140 

Val Glu He Trp 
155 

Pro Thr Gly Glu 
170 

Asp Glu Asn Gly 
185 

Gly Cys Pro Pro 

Arg His Gly Asn 
220 

Gly His Arg Lys 
235 

Thr Tyr Asp Asp 
250 

Ala Val Glu His 
265 

Gly Pro Phe Ala 

Asp Gly Val Asp 
300 



Asp Phe Leu Arg 
15 

Arg Val Lys Gin 
30 

He Glu Asp Leu 
45 

Tyr Leu Asn Gin 

Gly Leu Gly Phe 
80 

Ala Ala Leu Gly 
95 

Leu Tyr Val Ala 
110 

Asp Gly Ser Asp 
125 

Phe Asp Ala Asp 

His Ala Asn Thr 
160 

Gin Gin Ala Phe 
175 

Gin Tyr Arg Val 
190 

Glu Gly Pro Thr 
205 

Arg Pro Ala His 

Leu Thr Thr Gin 
240 

Phe Ala Tyr Ala 
255 

Thr Asp Pro Glu 
270 

Glu Met Val Phe 
285 

Asn Gin Val Val 



<210> 48 
<211> 414 
<212> PRT 

<213> Pseudomonas putida 
<400> 48 

Thr Thr Glu Thr He Gin Ser Asn 
1 5 

His Val Pro Glu His Leu Val Phe 
20 

Asn Leu Ser Ala Gly Val Gin Glu 

35 40 
Asn Val Pro Asp Leu Val Trp Thr 

50 55 
Ala Thr Arg Gly Gin Leu He Arg 



Ala Asn Leu Ala Pro Leu Pro Pro 

10 15 
Asp Phe Asp Met Tyr Asn Pro Ser 
25 30 
Ala Trp Ala Val Leu Gin Glu Ser 
45 

Arg Cys Asn Gly Gly His Trp He 
60 

Glu Ala Tyr Glu Asp Tyr Arg His 
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6 5 










7 0 










75 










80 


Phe 


Ser 


Ser 


Glu 


Cys 


Pro 


Phe 


He 


Pro 


Arg 


Glu 


Ala 


Gly Glu Ala 


Tyr 










R c: 
£5 b 










9 0 










95 


Asp 


Phe 


JL J. G 


Pro 


inr 


Ser 


1 v 16l 


Asp 


Pro 


Pro 


Glu 


Gin 


Arg 


Gin 


Phe 


Arg 








10 0 










105 










110 




Ala 


Leu 


Ala 


As n 


bin 


val 


v aj. 


Gly Met 


Pro 


Val 


Val 


Asp 


Lys 


Leu 


Glu 






115 










120 










125 








Asn 


Arg 


T1 D 
IXc 


C* "1 r-> 

bin 




Leu 


Ala 


Cys 


Ser 


Leu 


He 


Glu 


Ser 


Leu 


Arg 


Pro 




130 










135 










140 








Gin Gly 




Cys 


Asn 


Phe 


Thr 


Glu Asp 


Tyr 


Ala 


Glu 


Pro 


Phe 


Pro 


He 


145 










150 










155 










160 


Arg 


He 


Phe 


Met 


Leu 


Leu 


Ala 


Gly Leu 


Pro 


Glu 


Glu 


Asp 


He 


Pro 


His 










lea 
lob 










170 










175 




Leu 


Lys 


Tyr 


Leu 
18 0 


inr 


Asp 


Gin 


Met 


Thr 
185 


Arg 


Pro 


Asp 


Gly 


Ser 
190 


Met 


Thr 


Phe 


Ala 


Cl it 
LxXU 

195 


a j. a 


Lys 


Glu 


Ala 


Leu 
200 


Tyr 


Asp 


Tyr 


Leu 


He 
205 


Pro 


He 


He 


Glu 


Gin 
210 


Arg 


Arg 


Gin 


Lys 


Pro 
215 


Gly 


Thr 


Asp 


Ala 


He 
22 0 


Ser 


He 


Val 


Ala 


Asn 


Gly 




Val 


Asn 


Gly Arg 


Pro 


He 


Thr 


Ser 


Asp 


Glu 


Ala 


Lys 


Arg 


225 










230 










235 










240 


Met 


Cys 


Gly 


Leu 


Leu 


Leu 


Val 


Gly Gly 


Leu 


Asp 


Thr 


Val 


Val 


Asn 


Phe 










245 










250 










255 




Leu 


Ser 


File 


Ser 


Met 


Glu 


Phe 


Leu 


Ala 


Lys 


Ser 


Pro 


Glu 


His 


Arg 


Gin 








260 










265 










270 




Glu 


Leu 


j_ j_e 


Gin Arg 


Pro 


Glu 


Arg 


He 


Pro 


Ala 


Ala 


Cys 


Glu 


Glu 


Leu 






1 1 nr 










280 










285 








Leu 


Arg 


Arg 


Phe 


Ser 


Leu 


Val 


Ala 


Asp 


Gly Arg 


He 


Leu 


Thr 


Ser 


Asp 




290 










295 










300 








Tyr 


Glu 


rite 


His 


Gly 


Val 


Gin 


Leu 


Lys 


Lys 


Gly Asp 


Gin 


He 


Leu 


Leu 


305 










310 










315 










320 


Pro 


Gin 


Met 


Leu 


Ser 


Gly Leu Asp 


Glu 


Arg 


Glu 


Asn 


Ala 


Cys 


Pro 


Met 










325 










330 










335 




His 


Val 


Asp 


Phe 


Ser 


Arg 


Gin 


Lys 


Val 


Ser 


His 


Thr 


Thr 


Phe 


Gly 


His 








340 










345 










350 




Gly 


Ser 


His 


Leu 


Cys 


Leu 


Gly Gin 


His 


Leu 


Ala 


Arg 


Arg 


Glu 


He 


He 






355 










360 










365 








Val 


Thr 
370 


Leu 


Lys 


Glu 


Trp 


Leu 
375 


Thr 


Arg 


He 


Pro 


Asp 
380 


Phe 


Ser 


He 


Ala 


Pro 


Gly 


Ala 


Gin 


He 


Gin 


His 


Lys 


Ser 


Gly 


He 


Val 


Ser 


Gly 


Val 


Gin 


385 










390 










395 








400 


Ala 


Leu 


Pro 


Leu 


Val 
405 


Trp 


Asp 


Pro 


Ala 


Thr 
410 


Thr 


Lys 


Ala 


Val 







<210> 49 


























<211> 374 


























<212> PRT 


























<213> Equus caballus 






















<400> 49 


























Ser Thr Ala 


Gly Lys 


Val 


He 


Lys 


Cys 


Lys 


Ala 


Ala 


Val 


Leu 


Trp 


Glu 


1 


5 










10 










15 




Glu Lys Lys 


Pro Phe 


Ser 


He 


Glu 


Glu 


Val 


Glu 


Val 


Ala 


Pro 


Pro 


Lys 




20 








25 










30 




Ala His Glu 


Val Arg 


He 


Lys 


Met 


Val 


Ala 


Thr 


Gly 


He 


Cys 


Arg 


Ser 


35 








40 










45 








Asp Asp His 


Val Val 


Ser 


Gly 


Thr 


Leu 


Val 


Thr 


Pro 


Leu 


Pro 


Val 


He 


50 






55 










60 










Ala Gly His 


Glu Ala 


Ala 


Gly 


He 


Val 


Glu 


Ser 


He 


Gly 


Glu 


Gly Val 


65 




70 










75 










80 


Thr Thr Val 


Arg Pro 


Gly 


Asp 


Lys 


Val 


He 


Pro 


Leu 


Phe 


Thr 


Pro 


Gin 




85 










90 










95 
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Cys 


Gly 


Lys 


Cys 
100 


Arg 


Val 


Cys 


Lys 


His 
105 


Pro 


Glu 


Gly 


Asn 


Phe 
110 


Cys 


Leu 


Lys 


Asn 


Asp 
115 


Leu 


Ser 


Met 


Pro 


Arg 
120 


Gly 


Thr 


Met 


Gin 


Asp 
125 


Gly 


Thr 


Ser 


Arg 


Phe 
130 


Thr 


Cys 


Arg 


Gly 


Lys 
135 


Pro 


He 


His 


His 


Phe 
140 


Leu 


Gly 


Thr 


Ser 


Thr 


Phe 


Ser 


Gin 


Tyr 


Thr 


Val 


Val 


Asp 


Glu 


He 


Ser 


Val 


Ala 


Lys 


He 


145 










150 










155 










160 


Asp 


Ala 


Ala 


Ser 


Pro 
165 


Leu 


Glu 


Lys 


Val 


Cys 
170 


Leu 


He 


Gly 


Cys 


Gly 
175 


Phe 


Ser 


Thr 


Gly 


Tyr 
180 


Gly 


Ser 


Ala 


Val 


Lys 
185 


Val 


Ala 


Lys 


Val 


Thr 
190 


Gin 


Gly 


Ser 


Thr 


Cys 
195 


Ala 


Val 


Phe 


Gly 


Leu 
200 


Gly 


Gly 


Val 


Gly 


Leu 
205 


Ser 


Val 


He 


Met 


Gly 
210 


Cys 


Lys 


Ala 


Ala 


Gly 
215 


Ala 


Ala 


Arg 


He 


He 
220 


Gly 


Val 


Asp 


He 


Asn 


Lys 


Asp 


Lys 


Phe 


Ala 


Lys 


Ala 


Lys 


Glu 


Val 


Gly 


Ala 


Thr 


Glu 


Cys 


225 










230 










235 










240 


Val 


Asn 


Pro 


Gin 


Asp 
245 


Tyr 


Lys 


Lys 


Pro 


He 
250 


Gin 


Glu 


Val 


Leu 


Thr 
255 


Glu 


Met 


Ser 


Asn 


Gly 
260 


Gly 


Val 


Asp 


Phe 


Ser 
265 


Phe 


Glu 


Val 


He 


Gly 
27 0 


Arg 


Leu 


Asp 


Thr 


Met 
275 


Val 


Thr 


Ala 


Leu 


Ser 
280 


Cys 


Cys 


Gin 


Glu 


Ala 
285 


Tyr 


Gly 


Val 


Ser 


Val 
290 


He 


Val 


Gly 


Val 


Pro 
295 


Pro 


Asp 


Ser 


Gin 


Asn 
300 


Leu 


Ser 


Met 


Asn 


Pro 


Met 


Leu 


Leu 


Leu 


Ser 


Gly 


Arg 


Thr 


Trp 


Lys 


Gly 


Ala 


He 


Phe 


Gly 


305 










310 










315 










320 


Gly 


Phe 


Lys 


Ser 


Lys 
325 


Asp 


Ser 


Val 


Pro 


Lys 
330 


Leu 


Val 


Ala 


Asp 


Phe 
335 


Met 


Ala 


Lys 


Lys 


Phe 
340 


Ala 


Leu 


Asp 


Pro 


Leu 
345 


He 


Thr 


His 


Val 


Leu 
350 


Pro 


Phe 


Glu 


Lys 


He 
355 


Asn 


Glu 


Gly 


Phe 


Asp 
360 


Leu 


Leu 


Arg 


Ser 


Gly 
365 


Glu 


Ser 


He 


Arg 


Thr 
370 


He 


Leu 


Thr 


Phe 























<210> 50 
<211> 297 
<212> PRT 

<213> Escherichia coli 
<400> 50 

Met Ala Thr Asn Leu Arg Gly Val Met Ala Ala Leu Leu Thr Pro Phe 

15 10 15 

Asp Gin Gin Gin Ala Leu Asp Lys Ala Ser Leu Arg Arg Leu Val Gin 

20 25 30 

Phe Asn He Gin Gin Gly He Asp Gly Leu Tyr Val Gly Gly Ser Thr 

35 40 45 

Gly Glu Ala Phe Val Gin Ser Leu Ser Glu Arg Glu Gin Val Leu Glu 

50 55 60 

He Val Ala Glu Glu Gly Lys Gly Lys He Lys Leu He Ala His Val 
65 70 75 80 

Gly Cys Val Thr Thr Ala Glu Ser Gin Gin Leu Ala Ala Ser Ala Lys 

85 90 95 

Arg Tyr Gly Phe Asp Ala Val Ser Ala Val Thr Pro Phe Tyr Tyr Pro 

100 105 110 

Phe Ser Phe Glu Glu His Cys Asp His Tyr Arg Ala He He Asp Ser 

115 120 125 

Ala Asp Gly Leu Pro Met Val Val Tyr Asn He Pro Ala Leu Ser Gly 

130 135 140 

Val Lys Leu Thr Leu Asp Gin He Asn Thr Leu Val Thr Leu Pro Gly 
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1 A ZZ 

J.4t o 










15 0 










155 










160 


Val 


Gly 


Ala 


Leu 


Lys 
165 


Gin 


Thr 


Ser 


Gly 


Asp 
170 


Leu 


Tyr 


Gin 


Met 


Glu 
175 


Gin 


lie 


Arg 


Arg 


Glu 
180 


His 


Pro 


Asp 


Leu 


Val 
185 


Leu 


Tyr 


Asn 


Gly 


Tyr 
190 


Asp 


Glu 


lie 


Phe 


Ala 
195 


Ser 


Gly 


Leu 


Leu 


Ala 
200 


Gly 


Ala 


Asp 


Gly 


Gly 
205 


He 


Gly 


Ser 


Thr 


Tyr 


Asn 


lie 


Met 


Gly 


Trp 


Arg Tyr 


Gin 


Gly 


He 


Val 


Lys 


Ala 


Leu 




210 










215 










220 










Lys 


Glu 


Gly Asp 


lie 


Gin 


Thr 


Ala 


Gin 


Lys 


Leu 


Gin 


Thr 


Glu 


Cys 


Asn 


225 










230 










235 










240 


Lys 


Val 


lie 


Asp 


Leu 
245 


Leu 


He 


Lys 


Thr 


Gly 
250 


Val 


Phe 


Arg 


Gly 


Leu 
255 


Lys 


Thr 


Val 


Leu 


His 
260 


Tyr 


Met 


Asp 


Val 


Val 
265 


Ser 


Val 


Pro 


Leu 


Cys 
270 


Arg 


Lys 


Pro 


Phe 


Gly 
275 


Pro 


Val 


Asp 


Glu 


Lys 
280 


Tyr 


Leu 


Pro 


Glu 


Leu 
285 


Lys 


Ala 


Leu 


Ala 


Gin 
290 


Gin 


Leu 


Met 


Gin 


Glu 
295 


Arg 


Gly 

















<210> 51 

<211> 268 

<212> PRT 

<213> Salmonella 



typhimurium 



<400> 51 

Met Glu Arg Tyr 

1 

Gly Ala Phe Val 
20 

Ser Leu Lys He 
35 

Glu Leu Gly Val 
50 

Gin Asn Ala Asn 
65 

Cys Phe Glu Met 

He Gly Leu Leu 
100 

Ala Phe Tyr Ala 
115 

Ala Asp Val Pro 
130 

Arg His Asn He 
145 

Asp Leu Leu Arg 

Leu Ser Arg Ser 
180 

Leu His His Leu 
195 

Leu Gin Gly Phe 
210 

Arg Ala Gly Ala 
225 

He Glu Lys Asn 

Ser Phe Val Ser 
260 



Glu Asn Leu Phe 
5 

Pro Phe Val Thr 

He Asp Thr Leu 
40 

Pro Phe Ser Asp 
55 

Leu Arg Ala Phe 
70 

Leu Ala Leu He 
85 

Met Tyr Ala Asn 

Arg Cys Glu Gin 
120 

Val Glu Glu Ser 
135 

Ala Pro He Phe 
150 

Gin Val Ala Ser 
165 

Gly Val Thr Gly 

He Glu Lys Leu 
200 

Gly He Ser Ser 
215 

Ala Gly Ala He 
230 

Leu Ala Ser Pro 
245 

Ala Met Lys Ala 



Ala Gin Leu Asn 
10 

Leu Gly Asp Pro 
25 

He Asp Ala Gly 

Pro Leu Ala Asp 
60 

Ala Ala Gly Val 
75 

Arg Glu Lys His 
90 

Leu Val Phe Asn 
105 

Val Gly Val Asp 

Ala Pro Phe Arg 
140 

He Cys Pro Pro 
155 

Tyr Gly Arg Gly 
170 

Ala Glu Asn Arg 
185 

Lys Glu Tyr His 

Pro Glu Gin Val 
220 

Ser Gly Ser Ala 
235 

Lys Gin Met Leu 
250 

Ala Ser Arg Ala 
265 



Asp Arg Arg Glu 
15 

Gly He Glu Gin 
30 

Ala Asp Ala Leu 
45 

Gly Pro Thr He 

Thr Pro Ala Gin 
80 

Pro Thr He Pro 
95 

Asn Gly He Asp 
110 

Ser Val Leu Val 
125 

Gin Ala Ala Leu 

Asn Ala Asp Asp 
160 

Tyr Thr Tyr Leu 
175 

Gly Ala Leu Pro 
190 

Ala Ala Pro Ala 
205 

Ser Ala Ala Val 

He Val Lys He 
240 

Ala Glu Leu Arg 
255 
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40 



<210> 52 

<211> 393 

<212> PRT 

<213> Actinoplanes missouriensis 



<400> 52 

Ser Val Gin Ala 

1 

Val Gly Trp Gin 
20 

Leu Asp Pro Val 
35 

Gly He Thr Phe 
50 

Gin Thr Arg Asp 
65 

Thr Gly Leu He 

Val Phe Lys Asp 
100 

Tyr Ala He Arg 
115 

Gly Ala Lys Thr 
130 

Asp Ser Ala Lys 
145 

Leu Asn Leu Leu 

Phe Ala He Glu 
180 

Pro Thr Ala Gly 
195 

Glu Leu Phe Gly 
210 

Leu Asn Phe Thr 
225 

Phe His He Asp 

Leu Val Phe Gly 
260 

Leu Leu Glu Asn 
275 

His Phe Asp Tyr 
290 

Glu Ser Ala Lys 
305 

Ala Lys Ala Phe 

Ser Lys Val Ala 
340 

Tyr Ala Glu Leu 
355 

Asp Ala Val Gly 
370 

Ala He Glu His 
385 



Thr 


Arg 


Glu 


Asp 


5 








Ala 


Arg 


Asp 


Ala 


Glu 


Ala 


Val 


His 








40 


His 


Asp 


Asp 


Asp 






55 




Gly 


He 


He 


Ala 




7 0 






Val 


Pro 


Met 


Val 


85 








Gly 


Gly 


Phe 


Thr 


Lys 


Val 


Leu 


Arg 








120 


Leu 


Val 


Leu 


Trp 






135 




Asp 


Val 


Ser 


Ala 




15 0 






Ala 


Gin 


Tyr 


Ser 


165 








Pro 


Lys 


Pro 


Asn 


His 


Ala 


He 


Ala 








200 


He 


Asn 


Pro 


Glu 






215 




Gin 


Gly 


He 


Ala 




2 3 0 






Leu 


Asn 


Gly 


Gin 


245 








His 


Gly 


Asp 


Leu 


Gly 


Pro 


Asp 


Gly 








280 


Lys 


Pro 


Ser 


Arg 






295 




Ala 


Asn 


He 


Arg 




310 






Arg 


Ala 


Asp 


Pro 


325 








Glu 


Leu 


Lys 


Thr 


Leu 


Ala 


Asp 


Arg 








360 


Ala 


Lys 


Gly 


Phe 






375 




Leu 


Leu 


Gly 


Ala 




390 







Lys Phe Ser Phe 
10 

Phe Gly Asp Ala 
25 

Lys Leu Ala Glu 

Leu Val Pro Phe 
60 

Gly Phe Lys Lys 
75 

Thr Thr Asn Leu 
90 

Ser Asn Asp Arg 
105 

Gin Met Asp Leu 

Gly Gly Arg Glu 
140 

Ala Leu Asp Arg 
155 

Glu Asp Arg Gly 
170 

Glu Pro Arg Gly 
185 

Phe Val Gin Glu 

Thr Gly Asn Glu 
220 

Gin Ala Leu Trp 
235 

His Gly Pro Lys 
250 

Leu Asn Ala Phe 
265 

Ala Pro Ala Tyr 

Thr Glu Asp Tyr 
300 

Met Tyr Leu Leu 
315 

Glu Val Gin Glu 
330 

Pro Thr Leu Asn 
345 

Ser Ala Phe Glu 

Gly Phe Val Lys 
380 

Arg 



Gly Leu Trp Thr 
15 

Thr Arg Thr Ala 
30 

He Gly Ala Tyr 
45 

Gly Ser Asp Ala 

Ala Leu Asp Glu 
80 

Phe Thr His Pro 
95 

Ser Val Arg Arg 
110 

Gly Ala Glu Leu 
125 

Gly Ala Glu Tyr 

Tyr Arg Glu Ala 
160 

Tyr Gly Leu Arg 
175 

Asp He Leu Leu 
190 

Leu Glu Arg Pro 
205 

Gin Met Ser Asn 

His Lys Lys Leu 
240 

Phe Asp Gin Asp 
255 

Ser Leu Val Asp 
270 

Asp Gly Pro Arg 
285 

Asp Gly Val Trp 

Leu Lys Glu Arg 
320 

Ala Leu Ala Ala 
335 

Pro Gly Glu Gly 
350 

Asp Tyr Asp Ala 
365 

Leu Asn Gin Leu 



<210> 53 

<211> 348 

<212> PRT 

<213> Bacteriophage T7 
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<400> 53 



Val 


Asn 


He 


Lys 


Thr 


Asn 


Pro 


Phe 


Lys 


Ala 


Val 


Ser 


Phe 


Val 


Glu 


Ser 


1 








5 










10 










15 




Ala 


lie 


Lys 


Lys 


Ala 


Leu 


Asp 


Asn 


Ala 


Gly 


Tyr 


Leu 


He 


Ala 


Glu 


He 








20 










25 










30 






Lys 


Tyr 


Asp 


Gly 


Val 


Arg 


Gly 


Asn 


He 


Cys 


Val 


Asp 


Asn 


Thr 


Ala 


Asn 






35 










40 










45 








Ser 


Tyr 


Trp 


Leu 


Ser 


Arg 


Val 


Ser 


Lys 


Thr 


He 


Pro 


Ala 


Leu 


Glu 


His 




50 










55 










60 










Leu 


Asn 


Gly 


Phe 


Asp 


Val 


Arg 


Trp 


Lys 


Arg 


Leu 


Leu 


Asn 


Asp 


Asp 


Arg 


65 










70 










75 










80 


Cys 


Phe 


Tyr 


Lys 


Asp 


Gly 


Phe 


Met 


Leu 


Asp 


Gly 


Glu 


Leu 


Met 


Val 


Lys 










85 










90 










95 




Gly Val 


Asp 


Phe 


Asn 


Thr 


Gly 


Ser 


Gly 


Leu 


Leu 


Arg 


Thr 


Lys 


Trp 


Thr 








100 










105 










110 






Asp 


Thr 


Lys 


Asn 


Gin 


Glu 


Phe 


His 


Glu 


Glu 


Leu 


Phe 


Val 


Glu 


Pro 


He 






115 










120 










125 








Arg 


Lys 


Lys 


Asp 


Lys 


Val 


Pro 


Phe 


Lys 


Leu 


His 


Thr 


Gly 


His 


Leu 


His 




130 










135 










140 










lie 


Lys 


Leu 


Tyr 


Ala 


He 


Leu 


Pro 


Leu 


His 


lie 


Val 


Glu 


Ser 


Gly 


Glu 


145 










150 










155 










160 


Asp 


Cys 


Asp 


Val 


Met 


Thr 


Leu 


Leu 


Met 


Gin 


Glu 


His 


Val 


Lys 


Asn 


Met 










165 










170 










175 




Leu 


Pro 


Leu 


Leu 


Gin 


Glu 


Tyr 


Phe 


Pro 


Glu 


He 


Glu 


Trp 


Gin 


Ala 


Ala 








180 










185 










190 






Glu 


Ser 


Tyr 


Glu 


Val 


Tyr 


Asp 


Met 


Val 


Glu 


Leu 


Gin 


Gin 


Leu 


Tyr 


Glu 






195 










200 










205 








Gin 


Lys 


Arg 


Ala 


Glu 


Gly 


His 


Glu 


Gly 


Leu 


He 


Val 


Lys 


Asp 


Pro 


Met 




210 










215 










220 










Cys 


lie 


Tyr 


Lys 


Arg 


Gly 


Lys 


Lys 


Ser 


Gly 


Trp 


Trp 


Lys 


Met 


Lys 


Pro 


225 










230 










235 










240 


Glu 


Asn 


Glu 


Ala 


Asp 


Gly 


He 


He 


Gin 


Gly 


Leu 


Val 


Trp 


Gly 


Thr 


Lys 










245 










250 










255 




Gly 


Leu 


Ala 


Asn 


Glu 


Gly 


Lys 


Val 


He 


Gly 


Phe 


Glu 


Val 


Leu 


Leu 


Glu 








260 










265 










270 






Ser 


Gly 


Arg 


Leu 


Val 


Asn 


Ala 


Thr 


Asn 


He 


Ser 


Arg 


Ala 


Leu 


Met 


Asp 






275 










280 










285 








Glu 


Phe 


Thr 


Glu 


Thr 


Val 


Lys 


Glu 


Ala 


Thr 


Leu 


Ser 


Gin 


Trp 


Gly 


Phe 




290 










295 










300 










Phe 


Ser 


Pro 


Tyr 


Gly 


He 


Gly Asp 


Asn 


Asp 


Ala 


Cys 


Thr 


He 


Asn 


Pro 


305 










310 










315 










320 


Tyr 


Asp 


Gly 


Trp 


Ala 


Cys 


Gin 


He 


Ser 


Tyr 


Met 


Glu 


Glu 


Thr 


Pro 


Asp 










325 










330 










335 




Gly 


Ser 


Leu 


Arg 


His 


Pro 


Ser 


Phe 


Val 


Met 


Phe 


Arg 













340 


<210> 


54 


<211> 


42 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


binding site for restrl 


<220> 




<221> 


CDS 


<222> 


(2) . . (40) 


<223> 





345 



<400> 54 

9 9tg gta tea gca ggc cac tgc tac aag tec cgc ate cag gt 42 
Val Val Ser Ala Gly His Cys Tyr Lys Ser Arg He Gin 
15 10 



WO 2004/113522 PCT/EP2004/051173 

42 



<210> 


55 


<211> 


13 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


binding site for restrl 


<400> 


55 



Val Val Ser Ala Gly His Cys Tyr Lys Ser Arg lie Gin 
15 10 



<210> 56 

<211> 42 

<212> DNA 

<213> artificial sequence 
<220> 

<223> forward primer restrl 

<400> 56 

ggtggtatcc gcgggccact gctacaagtc ccggatccag gt 42 



<210> 


57 


<211> 


42 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


reverse primer restr2 


<400> 


57 



acctggatcc gggacttgta gcagtggccc gcggatacca cc 42 



<210> 58 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> binding site for restr3 and restr4 
<220> 

<221> CDS 

<222> (3) . . (50) 
<223> 

<400> 58 

cc act ggc acg aag tgc etc ate tct ggc tgg ggc aac act gcg age 4 7 

Thr Gly Thr Lys Cys Leu He Ser Gly Trp Gly Asn Thr Ala Ser 
15 10 15 

tct 50 

Ser 



<210> 59 

<211> 16 

<212> PRT 

<213> artificial sequence 
<220> 

<223> binding site for restr3 and restr4 



<400> 59 



WO 2004/113522 
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43 

Thr Gly Thr Lys Cys Leu He Ser Gly Trp Gly Asn Thr Ala Ser Ser 

15 



1 


5 


10 


<210> 


60 




<211> 


50 




<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


forward primer restr3 




<400> 


60 




ccactggcac gaagtgcctc atctctggct 


qqgacaacac 


<210> 


61 




<211> 


50 




<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


reverse primer restr4 




<400> 


61 




agagctagca gtgttgcccc agccagagat 


cracrcrcacttQ 

ZD Z) ZD ZD 


<210> 


62 




<211> 


30 




<212> 


DNA 




<213> 


• artificial sequence 




<220> 






<223> 


primer puc-forward 




<400> 


62 




ggggtacccc accaccatga atccactcct 




<210> 


63 




<211> 


30 




<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


primer puc-reverse 




<400> 


63 




cgggatccgg tatagagact gaagagatac 




<210> 


64 




<211> 


39 




<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


oligox-SDRlf 




<220> 






<221> 


misc feature 




<222> 


(14) . . (31) 




<223> 


any nucleotide 




<220> 






<221> 


misc feature 




<222> 


(14) . . (31) 




<223> 


any nucleotide or amino acid residue 


<220> 







50 



50 



30 



30 
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<221> CDS 
<222> (2) . . (37) 
<223> 

<400> 64 

g ggc cac tgc tac nrm nnn nnn nnn nnn nnn aag tec eg 3 9 

Gly His Cys Tyr Xaa Xaa Xaa Xaa Xaa Xaa Lys Ser 
15 10 



<210> 65 
<211> 12 
<212> PRT 

<213> artificial sequence 
<220> 

<221> mis cofeature 
<222> (5) . . (5) 

<223> The 'Xaa' at location 5 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val , Gin, His, Pro, Leu, a stop codon, T 
yr, Trp, Cys, or Phe . 



<220> 

<221> mis cofeature 
<222> (6) . . (6) 

<223> The 'Xaa 1 at location 6 stands for Lys, Asn, Arg, Ser, Thr, lie, 

Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, T 
yr, Trp, Cys, or Phe. 

<220> 

<221> misc_feature 
<222> (7) . . (7) 

<223> The 1 Xaa ' at location 7 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
yr, Trp, Cys, or Phe. 

<220> 

<221> misc_feature 
<222> (8) . . (8) 

<223> The 'Xaa 1 at location 8 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
yr, Trp, Cys, or Phe. 



Asn, Arg, Ser, Thr, lie, 
Pro, Leu, a stop codon, T 



Asn, Arg, Ser, Thr, lie, 
Pro, Leu, a stop codon, T 



<220> 

<221> misc_f eature 
<222> (9) . . (9) 

<223> The 'Xaa' at location 9 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, T 
yr, Trp, Cys, or Phe. 



<220> 

<221> misc_feature 
<222> (10) . . (10) 

<223> The 'Xaa 1 at location 10 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 



<220> 

<223> oligox-SDRlf 
<220> 

<221> misc_feature 

<222> (14) . . (31) 

<223> any nucleotide 
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<220> 

<221> misc_f eature 
<222> (14) . . (31) 

<223> any nucleotide or amino acid residue 
<400> 65 

Gly His Cys Tyr Xaa Xaa Xaa Xaa Xaa Xaa Lys Ser 
15 10 



<210> 


66 


<211> 


45 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


oligox-SDRlr 


<220> 




<221> 


misc feature 


<222> 


(16) . . (33) 


<223> 


any nucleotide 


<400> 


66 



cgcccggtga cgatgnnnnn nnnnnnnnnn nnnttcaggg cctag 45 



<210> 67 
<211> 47 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligox-SDR2f 
<220> 

<221> CDS 
<222> (2) . . (46) 
<223> 
<220> 

<221> mis cofeature 
<222> (29) . . (43) 

<223> any nucleotide or amino acid residue 
<400> 67 

c aag tgc etc ate tct ggc tgg ggc aac nnn nnn nnn nnn nnn act g 4 7 

Lys Cys Leu lie Ser Gly Trp Gly Asn Xaa Xaa Xaa Xaa Xaa Thr 
15 10 15 



<210> 68 
<211> 15 
<212> PRT 

<213> artificial sequence 
<220> 

<221> misc__f eature 
<222> (10) . . (10) 

<223> The 'Xaa' at location 10 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val , Gin, His, Pro, Leu, a stop codon, 
Tyr , Trp , Cys , or Phe . 

<220> 

<221> misc__f eature 
<222> (11) . . (11) 

<223> The 'Xaa' at location 11 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 
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<220> 

<221> misc_f eature 
<222> (12) . . (12) 

<223> The 'Xaa' at location 12 stands for Lys , Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys , or Phe. 

<220> 

<221> mi sc_f eature 
<222> (13) . . (13) 

<223> The 'Xaa 1 at location 13 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 

<220> 

<221> misc__f eature 
<222> (14) . . (14) 

<223> The 'Xaa' at location 14 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 

<220> 

<223> oligox-SDR2f 
<220> 

<221> misc_f eature 
<222> (29) . . (43) 

<223> any nucleotide or amino acid residue 
<400> 68 

Lys Cys Leu lie Ser Gly Trp Gly Asn Xaa Xaa Xaa Xaa Xaa Thr 
15 10 15 



<210> 


69 


<211> 


55 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223 > 


oligox-SDR2r 


<220> 




<221> 


misc feature 


<222> 


(33) . . (47) 


<223> 


any base 


<220> 




<221> 


mi sc_f eature 


<222> 


(33) . . (47) 


<223> 


any nucleotide 



<400> 69 

catggttcac ggagtagaga ccgaccccgt tgnnnnnnnn nnnnnnntga cgatc 5 5 



<210> 


70 


<211> 


59 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


primer SDR1 -mutnnb- forward 


<220> 




<221> 


mi sc_f eature 


<222> 


(24) . . (40) 


<223> 


N=A, C, G, T; B=C, G, T; V=A, 
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<400> 70 

tggtatccgc gggccactgc tacnnbnnbn nbnnbnnbnn baagtcccgg atccaggtg 59 

<210> 71 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> primer SDR2 -mutnnb- reverse 
<220> 

<221> mis cofeature 

<222> (20) . . (33) 

<223> N=A, C, G, T; B = C, G, T; V=A, C, G 

<400> 71 

ggcgccagag ctagcagtvn nvnnvnnvnn vnngttgccc cagccagaga tg 52 



<210> 72 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant g SDR1 

<400> 72 

Ala Phe Phe Asn Gly Asp 



<210> 73 

<211> 5 

<212> PRT 

<213> artificial sequence 



<220> 

<223> variant g SDR2 

<400> 73 

Arg Lys Asp Pro Trp 
1 5 

<210> 74 

<211> 234 

<212> PRT 

<213> artificial sequence 



<220> 
<223> 



artificial sequence 
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<400> 74 

He Val Gly Gly Tyr 
1 5 
Ser Leu Asn Ser Gly 
20 

Gin Trp Val Val Ser 
35 

Ser Arg He Gin Val 
50 

Gly Asn Glu Gin Phe 
65 

Tyr Asp Arg Lys Thr 
85 

Ser Arg Ala Val He 
100 

Ala Pro Pro Ala Thr 
115 

Arg Lys Asp Phe Trp 
130 

Leu Gin Cys Leu Asp 
145 

Ser Tyr Pro Gly Lys 
165 

Glu Gly Gly Lys Asp 
180 

Cys Asn Gly Gin Leu 
195 

Gin Lys Asn Lys Pro 
210 

Trp He Lys Asn Thr 
225 



Asn Cys Glu Glu Asn Ser 
10 

Tyr His Phe Cys Gly Gly 
25 

Ala Gly His Cys Tyr Ala 
40 

Arg Leu Gly Glu His Asn 
55 

He Asn Ala Ala Lys He 
70 75 
Leu Asn Asn Asp He Met 
90 

Asn Ala Arg Val Ser Thr 
105 

Gly Thr Lys Cys Leu He 
120 

Thr Ala Ser Ser Gly Ala 
135 

Ala Pro Val Leu Ser Gin 
150 155 
He Thr Ser Asn Met Phe 
17 0 

Ser Cys Gin Gly Asp Ser 
185 

Gin Gly Val Val Ser Trp 
200 

Gly Val Tyr Thr Lys Val 
215 

He Ala Ala Asn Ser 
230 



Val Pro Tyr Gin Val 
15 

Ser Leu He Asn Glu 
30 

Ala Phe Asn Gly Lys 
45 

He Glu Val Leu Glu 
60 

He Arg His Pro Gin 
80 

Leu He Lys Leu Ser 
95 

He Ser Leu Pro Thr 
110 

Ser Gly Trp Gly Asn 
125 

Asp Tyr Pro Asp Glu 
140 

Ala Lys Cys Glu Ala 
160 

Cys Val Gly Phe Leu 
175 

Gly Gly Pro Val Val 
190 

Gly Asp Gly Cys Ala 
205 

Tyr Asn Tyr Val Lys 
220 



<210> 75 

<211> 234 

<212> PRT 

<213> artificial sequence 
<220> 

<223> artificial sequence 

<400> 75 

He Val Gly Gly Tyr Asn Cys Glu Glu Asn Ser Val Pro Tyr Gin Val 
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15 10 15 

Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu lie Asn Glu 

20 25 30 

Gin Trp Val Val Ser Ala Gly His Cys Tyr Ala Ala Phe Asn Gly Lys 

35 40 45 

Ser Arg lie Gin Val Arg Leu Gly Glu His Asn He Gly Val Leu Glu 

50 55 60 

Gly Asn Glu Gin Phe He Asn Ala Ala Lys lie He Arg His Pro Gin 
65 70 75 80 

Tyr Asp Trp Lys Thr Leu Asn Asn Asp He Met Leu He Lys Leu Ser 

85 90 95 

Ser Arg Ala Val He Asn Ala Arg Val Ser Thr He Ser Leu Pro Thr 

100 105 110 

Ala Pro Pro Ala Thr Gly Thr Lys Cys Leu He Ser Gly Trp Gly Asn 

115 120 125 

Arg Lys Asp Phe Trp Thr Ala Ser Ser Gly Ala Asp Phe Pro Asp Glu 

130 135 140 

Leu Gin Cys Leu Asp Ala Pro Val Leu Ser Gin Thr Lys Cys Glu Ala 
145 150 155 160 

Ser Tyr Pro Gly Lys He Thr Ser Asn Met Phe Cys Val Gly Phe Leu 

165 170 175 

Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Val Val 

180 185 190 

Arg Asn Gly Gin Leu Gin Gly Val Val Ser Trp Gly Asp Gly Cys Ala 

195 200 205 

Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys Val Tyr Asn Tyr Val Lys 

210 215 220 

Trp He Lys Asn Thr He Ala Ala Asn Ser 
225 230 



<210> 76 

<211> 12 

<212> PRT 

<213> artificial sequence 
<220> 

<223> substrate A 



<400> 76 

Leu Leu Trp Leu Gly Arg Val Val Gly Gly Pro Val 
1 " 5 10 



<210> 77 

<211> 12 

<212> PRT 

<213> artificial sequence 
<220> 
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50 

<223> substrate B 
<400> 77 

Lys Lys Trp Leu Gly Arg Val Pro Gly Gly Pro Val 
15 10 



<210> 


78 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variantl SDR1 


<400> 


78 


Asp Ala Val Gly Arg Asp 


1 


5 


<210> 


79 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant2 SDR1 


<400> 


79 


Asn Gly Arg Asp Leu Glu 


1 


5 


<210> 


80 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant3 SDR1 


<400> 


80 


Gly Phe Val Met Phe Asn 


1 


5 


<210> 


81 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variantl SDR2 


<400> 


81 


Arg Val His Pro Ser 


1 


5 


<210> 


82 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant2 SDR2 
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<400> 82 

Val Arg Gly Thr Trp 
1 5 

<210> 83 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant3 SDR2 

<400> 83 

Arg Ser Pro Leu Thr 



<210> 84 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant a SDR1 

<400> 84 

Arg Pro Trp Asp Pro Ser 



<210> 85 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<22 3> variant b SDR1 

<400> 85 

Gly Phe Val Met Phe Asn 



<210> 86 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant c SDR1 

<400> 86 

Glu lie Ala Asn Arg Glu 



<210> 87 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant d SDR1 



<400> 87 

Lys Ala Val Val Gly Thr 
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<210> 88 
<211> 6 
<212> PRT 



<213> artificial sequence 
<220> 

<22 3> variant e SDR1 
<400> 88 

Val Asn lie Met Ala Ala 



<210> 89 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant f SDR1 

<400> 89 

Ala Ala Phe Asn Gly Asp 



<210> 90 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant a SDR2 

<400> 90 

Val His Pro Thr Ser 
1 5 



<210> 91 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant b SDR2 

<400> 91 

Arg Ser Pro Leu Thr 



<210> 92 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<2 23> variant c SDR2 



<400> 92 

Arg Gly Ala Arg Thr 
1 5 
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<210> 


93 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant d SDR2 


<400> 


93 


Arg Thr Pro lie Ser 


1 


5 


<210> 


94 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant e SDR2 


<400> 


94 


Thr Thr Ala Arg Lys 


1 


5 


<210> 


95 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant f SDR2 


<400> 


95 


Arg Lys Asp Phe Trp 


1 


5 


<210> 


96 


<211> 


157 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


96 



Val 


Arg 


Ser 


Ser 


Ser 


Arg 


Thr 


Pro 


Ser 


Asp 


Lys 


Pro 


Val 


Ala 


His 


Val 


1 








5 










10 










15 




Val 


Ala 


Asn 


Pro 


Gin 


Ala 


Glu 


Gly 


Gin 


Leu 


Gin 


Trp 


Leu 


Asn 


Arg 


Arg 








20 










25 










30 






Ala 


Asn 


Ala 


Leu 


Leu 


Ala 


Asn 


Gly 


Val 


Glu 


Leu 


Arg 


Asp 


Asn 


Gin 


Leu 






35 










40 










45 








Val 


Val 


Pro 


Ser 


Glu 


Gly 


Leu 


Tyr 


Leu 


He 


Tyr 


Ser 


Gin 


Val 


Leu 


Phe 




50 










55 










60 










Lys 


Gly 


Gin 


Gly 


Cys 


Pro 


Ser 


Thr 


His 


Val 


Leu 


Leu 


Thr 


His 


Thr 


He 


65 










70 










75 










80 


Ser 


Arg 


lie 


Ala 


Val 


Ser 


Tyr 


Gin 


Thr 


Lys 


Val 


Asn 


Leu 


Leu 


Ser 


Ala 










85 










90 










95 




lie 


Lys 


Ser 


Pro 


Cys 


Gin 


Arg 


Glu 


Thr 


Pro 


Glu 


Gly Ala 


Glu 


Ala 


Lys 








100 










105 










110 






Pro 


Trp 


Tyr 


Glu 


Pro 


He 


Tyr 


Leu 


Gly Gly Val 


Phe 


Gin 


Leu 


Glu 


Lys 






115 










120 










125 








Qiy 


Asp 


Arg 


Leu 


Ser 


Ala 


Glu 


He 


Asn Arg 


Pro 


Asp 


Tyr 


Leu 


Leu 


Phe 




130 










135 










140 










Ala 


Glu 


Ser 


Gly 


Gin 


Val 


Tyr 


Phe 


Gly 


He 


He 


Ala 


Leu 








145 










150 










155 
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<210> 97 

<211> 306 

<212> PRT 

<213> Homo sapiens 



<400> 97 



lie 


Trp 


Glu 


Leu 


Lys 


Lys 


Asp 


Val 


Tyr 


Val 


Val 


Glu 


Leu 


Asp 


Trp 


Tyr 


1 








5 










10 










15 




Pro 


Asp 


Ala 


Pro 


Gly Glu Met 


Val 


Val 


Leu 


Thr 


Cys 


Asp 


Thr 


Pro 


Glu 








20 










25 










30 






Glu 


Asp 


Gly 


He 


Thr 


Trp 


Thr 


Leu 


Asp 


Gin 


Ser 


Ser 


Glu 


Val 


Leu 


Gly 






35 










40 










45 








Ser 


Gly 


Lys 


Thr 


Leu 


Thr 


He 


Gin 


Val 


Lys 


Glu 


Phe 


Gly Asp 


Ala 


Gly 




50 










55 










60 










Gin 


Tyr 


Thr 


Cys 


His 


Lys 


Gly 


Gly 


Glu 


Val 


Leu 


Ser 


His 


Ser 


Leu 


Leu 


65 










70 










75 










80 


Leu 


Leu 


His 


Lys 


Lys 


Glu 


Asp 


Gly 


He 


Trp 


Ser 


Thr 


Asp 


He 


Leu 


Lys 










85 










90 










95 




Asp 


Gin 


Lys 


Glu 


Pro 


Lys 


Asn 


Lys 


Thr 


Phe 


Leu 


Arg 


Cys 


Glu 


Ala 


Lys 








100 










105 










110 






Asn 


Tyr 


Ser 


Gly Arg 


Phe 


Thr 


Cys 


Trp 


Trp 


Leu 


Thr 


Thr 


He 


Ser 


Thr 






115 










120 










125 








Asp 


Leu 


Thr 


Phe 


Ser 


Val 


Lys 


Ser 


Ser 


Arg 


Gly 


Ser 


Ser 


Asp 


Pro 


Gin 




130 










135 










140 










Gly Val 


Thr 


Cys 


Gly Ala 


Ala 


Thr 


Leu 


Ser 


Ala 


Glu 


Arg 


Val 


Arg 


Gly 


145 










150 










155 










160 


Asp 


Asn 


Lys 


Glu 


Tyr 


Glu 


Tyr 


Ser 


Val 


Glu 


Cys 


Gin 


Glu 


Asp 


Ser 


Ala 










165 










170 










175 




Cys 


Pro 


Ala 


Ala 


Glu 


Glu 


Ser 


Leu 


Pro 


He 


Glu 


Val 


Met 


Val 


Asp 


Ala 








180 










185 










190 






Val 


His 


Lys 


Leu 


Lys 


Tyr 


Glu 


Asn 


Tyr 


Thr 


Ser 


Ser 


Phe 


Phe 


He 


Arg 






195 










200 










205 








Asp 


lie 


He 


Lys 


Pro 


Asp 


Pro 


Pro 


Lys 


Asn 


Leu 


Gin 


Leu 


Lys 


Pro 


Leu 




210 










215 










220 










Lys 


Asn 


Ser 


Arg 


Gin 


Val 


Glu 


Val 


Ser 


Trp 


Glu 


Tyr 


Pro 


Asp 


Thr 


Trp 


225 










230 










235 










240 


Ser 


Thr 


Pro 


His 


Ser 


Tyr 


Phe 


Ser 


Leu 


Thr 


Phe 


Cys 


Val 


Gin 


Val 


Gin 










245 










250 










255 




Gly 


Lys 


Ser 


Lys 


Arg 


Glu 


Lys 


Lys 


Asp 


Arg 


Val 


Phe 


Thr 


Asp 


Lys 


Thr 








260 










265 










270 






Ser 


Ala 


Thr 


Val 


He 


Cys 


Arg 


Lys 


Asn 


Ala 


Ser 


He 


Ser 


Val 


Arg 


Ala 






275 










280 










285 








Gin 


Asp 


Arg 


Tyr 


Tyr 


Ser 


Ser 


Ser 


Trp 


Ser 


Glu 


Trp 


Ala 


Ser 


Val 


Pro 



290 295 300 



Cys Ser 
305 



<210> 98 

<211> 157 

<212> PRT 

<213> Homo sapiens 

<400> 98 

Tyr Phe Gly Lys Leu Glu Ser Lys 
1 5 

Asp Gin Val Leu Phe He Asp Gin 
20 

Met Thr Asp Ser Asp Cys Arg Asp 

35 40 
He Ser Met Tyr Lys Asp Ser Gin 
50 55 



Leu Ser Val He Arg Asn Leu Asn 

10 15 
Gly Asn Arg Pro Leu Phe Glu Asp 
25 30 
Asn Ala Pro Arg Thr He Phe He 
45 

Pro Arg Gly Met Ala Val Thr He 
60 
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Ser Val Lys Cys 
65 

lie Ser Phe Lys 

Ser Asp lie lie 
100 

Met Gin Phe Glu 
115 

Lys Glu Arg Asp 
130 

Gly Asp Arg Ser 
145 



Glu Lys lie Ser 
70 

Glu Met Asn Pro 
85 

Phe Phe Gin Arg 

Ser Ser Ser Tyr 
120 

Leu Phe Lys Leu 
135 

lie Met Phe Thr 
150 



Thr Leu Ser Cys 
75 

Pro Asp Asn lie 
90 

Ser Val Pro Gly 
105 

Glu Gly Tyr Phe 

lie Leu Lys Lys 
140 

Val Gin Asn Glu 
155 



Glu 


Asn 


Lys 


He 








80 


Lys 


Asp 


Thr 


Lys 






95 




His 


Asp 


Asn 


Lys 




110 






Leu 


Ala 


Cys 


Glu 


125 








Glu 


Asp 


Glu 


Leu 



Asp 



<210> 99 

<211> 133 

<212> PRT 

<213> Homo sapiens 

<400> 99 

Ala Pro Thr Ser Ser Ser Thr Lys Lys Thr Gin Leu Gin Leu Glu His 

15 10 15 

Leu Leu Leu Asp Leu Gin Met He Leu Asn Gly He Asn Asn Tyr Lys 

20 25 30 

Asn Pro Lys Leu Thr Arg Met Leu Thr Phe Lys Phe Tyr Met Pro Lys 

35 40 45 

Lys Ala Thr Glu Leu Lys His Leu Gin Cys Leu Glu Glu Glu Leu Lys 

50 55 60 

Pro Leu Glu Glu Val Leu Asn Leu Ala Gin Ser Lys Asn Phe His Leu 
65 70 75 80 

Arg Pro Arg Asp Leu He Ser Asn He Asn Val lie Val Leu Glu Leu 

85 90 95 

Lys Gly Ser Glu Thr Thr Phe Met Cys Glu Tyr Ala Asp Glu Thr Ala 

100 105 110 

Thr He Val Glu Phe Leu Asn Arg Trp lie Thr Phe Cys Gin Ser He 

115 120 125 

lie Ser Thr Leu Thr 
130 



<210> 100 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<400> 100 

Ser Ala Lys Glu Leu Arg Cys Gin 
1 5 

Phe His Pro Lys Phe He Lys Glu 
20 

His Cys Ala Asn Thr Glu He He 

35 40 
Leu Cys Leu Asp Pro Lys Glu Asn 

50 55 
Phe Leu Lys Arg Ala Glu Asn Ser 
65 70 



Cys He Lys Thr Tyr Ser Lys Pro 

10 15 
Leu Arg Val He Glu Ser Gly Pro 
25 30 
Val Lys Leu Ser Asp Gly Arg Glu 
45 

Trp Val Gin Arg Val Val Glu Lys 
60 



<210> 101 

<211> 74 

<212> PRT 

<213> Homo sapiens 
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Gly 


Pro 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Cys 


Cys 


Phe 


Asn 


Leu 


Ala 


Asn 


Arg 


1 








5 










10 










15 




Lys 


He 


Pro 


Leu 


Gin Arg 


Leu 


Glu 


Ser 


Tyr 


Arg 


Arg 


He 


Thr 


Ser 


Gly 








20 










25 










30 






Lys 


Cys 


Pro 


Gin 


Lys 


Ala 


Val 


He 


Phe 


Lys 


Thr 


Lys 


Leu 


Ala 


Lys 


Asp 






35 










40 










45 








He 


Cys 


Ala 


Asp 


Pro 


Lys 


Lys 


Lys 


Trp 


Val 


Gin 


Asp 


Ser 


Met 


Lys 


Tyr 




50 










55 










60 










Leu 


Asp 


Gin 


Lys 


Ser 


Pro 


Thr 


Pro 


Lys 


Pro 














65 










70 






















<210> 


102 




























<211> 


76 




























<212> 


PRT 




























<213> Homo 


sapiens 
























<400> 


102 




























Gin 


Pro 


Asp 


Ala 


He 


Asn 


Ala 


Pro 


Val 


Thr 


Cys 


Cys 


Tyr 


Asn 


Phe 


Thr 


1 








5 










10 










15 




Asn 


Arg 


Lys 


He 


Ser 


Val 


Gin 


Arg 


Leu 


Ala 


Ser 


Tyr 


Arg 


Arg 


He 


Thr 








20 










25 










30 






Ser 


Ser 


Lys 


Cys 


Pro 


Lys 


Glu 


Ala 


Val 


He 


Phe 


Lys 


Thr 


He 


Val 


Ala 






35 










40 










45 








Lys 


Glu 


He 


Cys 


Ala 


Asp 


Pro 


Lys 


Gin 


Lys 


Trp 


Val 


Gin 


Asp 


Ser 


Met 




50 










55 










60 










Asp 


His 


Leu 


Asp 


Lys 


Gin 


Thr 


Gin 


Thr 


Pro 


Lys 


Thr 










65 










70 










75 












<210> 


103 




























<211> 


206 




























<212> 


PRT 




























<213> Homo 


sapiens 
























<400> 


103 




























Ala 


Pro 


Met 


Ala 


Glu 


Gly 


Gly 


Gly Gin 


Asn 


His 


His 


Glu 


Val 


Val 


Lys 


1 . 








5 










10 










15 




Phe 


Met 


Asp 


Val 


Tyr 


Gin 


Arg 


Ser 


Tyr 


Cys 


His 


Pro 


He 


Glu 


Thr 


Leu 








20 










25 










30 






Val 


Asp 


He 


Phe 


Gin 


Glu 


Tyr 


Pro 


Asp 


Glu 


He 


Glu 


Tyr 


He 


Phe 


Lys 






35 










40 










45 








Pro 


Ser 


Cys 


Val 


Pro 


Leu 


Met 


Arg 


Cys 


Gly Gly 


Cys 


Cys 


Asn 


Asp 


Glu 




50 










55 










60 










Gly 


Leu 


Glu 


Cys 


Val 


Pro 


Thr 


Glu 


Glu 


Ser 


Asn 


He 


Thr 


Met 


Gin 


He 


65 










70 










7 5 










80 


Met 


Arg 


He 


Lys 


Pro 


His 


Gin 


Gly Gin 


His 


He 


Gly 


Glu 


Met 


Ser 


Phe 










85 










90 










95 




Leu 


Gin 


His 


Asn 


Lys 


Cys 


Glu 


Cys 


Arg 


Pro 


Lys 


Lys 


Asp 


Arg 


Ala 


Arg 








100 










105 










110 






Gin 


Glu 


Lys 


Lys 


Ser 


Val 


Arg 


Gly 


Lys 


Gly 


Lys 


Gly 


Gin 


Lys 


Arg 


Lys 






115 










120 










125 








Arg 


Lys 


Lys 


Ser 


Arg 


Tyr 


Lys 


Ser 


Trp 


Ser 


Val 


Tyr 


Val 


Gly 


Ala 


Arg 




130 










135 










140 










Cys 


Cys 


Leu 


Met 


Pro 


Trp 


Ser 


Leu 


Pro 


Gly 


Pro 


His 


Pro 


Cys 


Gly 


Pro 


145 










150 










155 










160 


Cys 


Ser 


Glu 


Arg 


Arg 


Lys 


His 


Leu 


Phe 


Val 


Gin 


Asp 


Pro 


Gin 


Thr 


Cys 










165 










170 










175 




Lys 


Cys 


Ser 


Cys 


Lys 


Asn 


Thr 


Asp 


Ser 


Arg 


Cys 


Lys 


Ala 


Arg 


Gin 


Leu 








180 










185 










190 






Glu 


Leu 


Asn 


Glu 


Arg 


Thr 


Cys 


Arg 


Cys 


Asp 


Lys 


Pro 


Arg 


Arg 










195 










200 










205 
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<210> 104 

<211> 112 

<212> PRT 

<213> Homo sapiens 



<400> 104 



Ala Leu Asp Thr 
1 

Val Arg Gin Leu 
20 

lie His Glu Pro 
35 

Pro Tyr lie Trp 
50 

Tyr Asn Gin His 
65 

Gin Ala Leu Glu 

Lys Val Glu Gin 
100 



Asn Tyr Cys Phe 
5 

Tyr He Asp Phe 

Lys Gly Tyr His 
40 

Ser Leu Asp Thr 
55 

Asn Pro Gly Ala 
70 

Pro Leu Pro He 
85 

Leu Ser Asn Met 



Ser Ser Thr Glu 
10 

Arg Lys Asp Leu 
25 

Ala Asn Phe Cys 

Gin Tyr Ser Lys 
60 

Ser Ala Ala Pro 
75 

Val Tyr Tyr Val 
90 

lie Val Arg Ser 
105 



Lys Asn Cys Cys 
15 

Gly Trp Lys Trp 
30 

Leu Gly Pro Cys 
45 

Val Leu Ala Leu 

Cys Cys Val Pro 
80 

Gly Arg Lys Pro 
95 

Cys Lys Cys Ser 
110 



<210> 105 

<211> 30 

<212> PRT 

<213> Homo sapiens 



<400> 105 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
15 10 15 

Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 
20 25 30 



<210> 106 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 106 

Gly He Val Glu Gin Cys Cys Thr Ser He Cys Ser Leu Tyr Gin Leu 
15 10 15 

Glu Asn Tyr Cys Asn 
20 



<210> 107 

<211> 28 

<212> PRT 

<213> Homo sapiens 



<400> 107 

Gly Ser Ser Phe Leu Ser Pro Glu His Gin Arg Val Gin Gin Arg Lys 
15 10 15 

Glu Ser Lys Lys Pro Pro Ala Lys Leu Gin Pro Arg 
20 25 



<210> 
<211> 



108 
9 
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58 

<212> PRT 

<213> Homo sapiens 

<400> 108 



Arg 
1 


Val 


Tyr 


He 


His 
5 


Pro 


Phe 


His 


Leu 
















<210> 


109 




























<211> 


114 




























<212> 


PRT 




























<213> 


Homo 


sapiens 
























<400> 


109 




























Pro 


Met 


Phe 


He 


Val 


Asn 


Thr 


Asn 


Val 


Pro 


Arg 


Ala 


Ser 


Val 


Pro 


Asp 


1 








5 










10 










15 




Gly 


Phe 


Leu 


Ser 


Glu 


Leu 


Thr 


Gin 


Gin 


Leu 


Ala 


Gin 


Ala 


Thr 


Gly Lys 








20 










25 










30 






Pro 


Pro 


Gin 


Tyr 


He 


Ala 


Val 


His 


Val 


Val 


Pro 


Asp 


Gin 


Leu 


Met 


Ala 






35 










40 










45 








Phe 


Gly 


Gly 


Ser 


Ser 


Glu 


Pro 


Cys 


Ala 


Leu 


Cys 


Ser 


Leu 


His 


Ser 


He 




50 










55 










60 










Gly 


Lys 


He 


Gly 


Gly 


Ala 


Gin 


Asn 


Arg 


Ser 


Tyr 


Ser 


Lys 


Leu 


Leu 


Cys 


65 










70 










75 










80 


Gly 


Leu 


Leu 


Ala 


Glu 


Arg 


Leu 


Arg 


He 


Ser 


Pro 


Asp 


Arg 


Val 


Tyr 


He 










85 










90 










95 




Asn 


Tyr 


Tyr 


Asp 


Met 


Asn 


Ala 


Ala 


Asn 


Val 


Gly 


Trp 


Asn 


Asn 


Ser 


Thr 








100 










105 










110 






Phe 


Ala 






























<210> 


110 




























<211> 


425 




























<212> 


PRT 




























<213> 


Homo 


sapiens 
























<400> 


110 




























Met 


Gly 


Pro 


Arg 


Arg 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Cys 


Phe 


Ser 


Leu 


Cys 


1 








5 










10 










15 




Gly 


Pro 


Leu 


Leu 


Ser 


Ala 


Arg 


Thr 


Arg 


Ala 


Arg 


Arg 


Pro 


Glu 


Ser 


Lys 








20 










25 










30 






Ala 


Thr 


Asn 


Ala 


Thr 


Leu 


Asp 


Pro 


Arg 


Ser 


Phe 


Leu 


Leu 


Arg 


Asn 


Pro 






35 










40 










45 








Asn 


Asp 


Lys 


Tyr 


Glu 


Pro 


Phe 


Trp 


Glu 


Asp 


Glu 


Glu 


Lys 


Asn 


Glu 


Ser 




50 










55 










60 










Gly 


Leu 


Thr 


Glu 


Tyr 


Arg 


Leu 


Val 


Ser 


He 


Asn 


Lys 


Ser 


Ser 


Pro 


Leu 


65 










7 0 










7 5 










80 


Gin 


Lys 


Gin 


Leu 


Pro 


Ala 


Phe 


He 


Ser 


Glu 


Asp 


Ala 


Ser 


Gly 


Tyr 


Leu 










85 










90 










95 




Thr 


Ser 


Ser 


Trp 


Leu 


Thr 


Leu 


Phe 


Val 


Pro 


Ser 


Val 


Tyr 


Thr 


Gly Val 








100 










105 










110 






Phe 


Val 


Val 


Ser 


Leu 


Pro 


Leu 


Asn 


He 


Met 


Ala 


He 


Val 


Val 


Phe 


He 






115 










120 










125 








Leu 


Lys 


Met 


Lys 


Val 


Lys 


Lys 


Pro 


Ala 


Val 


Val 


Tyr 


Met 


Leu 


His 


Leu 




130 










135 










140 










Ala 


Thr 


Ala 


Asp 


Val 


Leu 


Phe 


Val 


Ser 


Val 


Leu 


Pro 


Phe 


Lys 


He 


Ser 


145 










150 










155 










160 


Tyr 


Tyr 


Phe 


Ser 


Gly 


Ser 


A sp 


Trp 


Gin 


Phe 


Gly 


Ser 


Glu 


Leu 


Cys 


Arg 










165 










170 










175 




Phe 


Val 


Thr 


Ala 


Ala 


Phe 


Tyr 


Cys 


Asn 


Met 


Tyr 


Ala 


Ser 


He 


Leu 


Leu 








180 










185 










190 






Met 


Thr 


Val 


He 


Ser 


He 


Asp 


Arg 


Phe 


Leu 


Ala 


Val 


Val 


Tyr 


Pro 


Met 
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195 200 205 

Gin Ser Leu Ser Trp Arg Thr Leu Gly Arg Ala Ser Phe Thr Cys Leu 

210 215 220 

Ala He Trp Ala Leu Ala He Ala Gly Val Val Pro Leu Leu Leu Lys 
225 230 235 240 

Glu Gin Thr He Gin Val Pro Gly Leu Asn He Thr Thr Cys His Asp 

245 250 255 

Val Leu Asn Glu Thr Leu Leu Glu Gly Tyr Tyr Ala Tyr Tyr Phe Ser 

260 265 270 

Ala Phe Ser Ala Val Phe Phe Phe Val Pro Leu He He Ser Thr Val 

275 280 285 

Cys Tyr Val Ser He He Arg Cys Leu Ser Ser Ser Ala Val Ala Asn 

290 295 300 

Arg Ser Lys Lys Ser Arg Ala Leu Phe Leu Ser Ala Ala Val Phe Cys 
305 310 315 320 

He Phe He He Cys Phe Gly Pro Thr Asn Val Leu Leu He Ala His 

325 330 335 

Tyr Ser Phe Leu Ser His Thr Ser Thr Thr Glu Ala Ala Tyr Phe Ala 

340 345 350 

Tyr Leu Leu Cys Val Cys Val Ser Ser He Ser Cys Cys He Asp Pro 

355 360 365 

Leu He Tyr Tyr Tyr Ala Ser Ser Glu Cys Gin Arg Tyr Val Tyr Ser 

370 375 380 

He Leu Cys Cys Lys Glu Ser Ser Asp Pro Ser Ser Tyr Asn Ser Ser 
385 390 395 400 

Gly Gin Leu Met Ala Ser Lys Met Asp Thr Cys Ser Ser Asn Leu Asn 

405 410 415 

Asn Ser He Tyr Lys Lys Leu Leu Thr 
420 425 



<210> 111 

<211> 397 

<212> PRT 

<213> Homo sapiens 



<400> 111 



Met 


Arg 


Ser 


Pro 


Ser 


Ala 


Ala 


Trp 


Leu 


Leu 


Gly Ala 


Ala 


He 


Leu 


Leu 


1 








5 










10 










15 




Ala 


Ala 


Ser 


Leu 


Ser 


Cys 


Ser 


Gly 


Thr 


He 


Gin 


Gly Thr 


Asn 


Arg 


Ser 








20 










25 










30 






Ser 


Lys 


Gly 


Arg 


Ser 


Leu 


He 


Gly 


Lys 


Val 


Asp 


Gly 


Thr 


Ser 


His 


Val 






35 










40 










45 








Thr 


Gly 


Lys 


Gly 


Val 


Thr 


Val 


Glu 


Thr 


Val 


Phe 


Ser 


Val 


Asp 


Glu 


Phe 




50 










55 










60 










Ser 


Ala 


Ser 


Val 


Leu 


Thr 


Gly Lys 


Leu 


Thr 


Thr 


Val 


Phe 


Leu 


Pro 


He 


65 










70 










75 










80 


Val 


Tyr 


Thr 


He 


Val 


Phe 


Val 


Val 


Gly 


Leu 


Pro 


Ser 


Asn 


Gly 


Met 


Ala 










85 










90 










95 




Leu 


Trp 


Val 


Phe 


Leu 


Phe 


Arg 


Thr 


Lys 


Lys 


Lys 


His 


Pro 


Ala 


Val 


He 








100 










105 










110 






Tyr 


Met 


Ala 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Leu 


Leu 


Ser 


Val 


He 


Trp 


Phe 






115 










12 0 










125 








Pro 


Leu 


Lys 


He 


Ala 


Tyr 


His 


He 


His 


Gly 


Asn 


Asn 


Trp 


He 


Tyr 


Gly 




130 










135 










140 










Glu 


Ala 


Leu 


Cys 


Asn 


Val 


Leu 


He 


Gly 


Phe 


Phe 


Tyr 


Gly 


Asn 


Met 


Tyr 


145 










150 










155 










160 


Cys 


Ser 


He 


Leu 


Phe 


Met 


Thr 


Cys 


Leu 


Ser 


Val 


Gin 


Arg 


Tyr 


Trp 


Val 










165 










170 










175 




He 


Val 


Asn 


Pro 


Met 


Gly 


His 


Ser 


Arg 


Lys 


Lys 


Ala 


Asn 


He 


Ala 


He 








180 










185 










190 






Gly He 


Ser 


Leu 


Ala 


He 


Trp 


Leu 


Leu 


He 


Leu 


Leu 


Val 


Thr 


He 


Pro 






195 










200 










205 
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Leu 


Tyr 


Val 


Val 


Lys 


Gin 


Thr 


He 


Phe 


He 


Pro 


Ala 


Leu 


Asn 


He 


Thr 




210 










215 










220 










Thr 


Cys 


His 


Asp 


Val 


Leu 


Pro 


Glu 


Gin 


Leu 


Leu 


Val 


Gly 


Asp 


Met 


Phe 


225 










230 










235 










240 


Asn 


Tyr 


Phe 


Leu 


Ser 


Leu 


Ala 


lie 


Gly 


Val 


Phe 


Leu 


Phe 


Pro 


Ala 


Phe 










245 










250 










255 




Leu 


Thr 


Ala 


Ser 


Ala 


Tyr 


Val 


Leu 


Met 


He 


Arg 


Met 


Leu 


Arg 


Ser 


Ser 








260 










265 










270 






Ala 


Met 


Asp 


Glu 


Asn 


Ser 


Glu 


Lys 


Lys 


Arg 


Lys 


Arg 


Ala 


He 


Lys 


Leu 






275 










280 










285 








lie 


Val 


Thr 


Val 


Leu 


Ala 


Met 


Tyr 


Leu 


He 


Cys 


Phe 


Thr 


Pro 


Ser 


Asn 




290 










295 










300 










Leu 


Leu 


Leu 


Val 


Val 


His 


Tyr 


Phe 


Leu 


He 


Lys 


Ser 


Gin 


Gly 


Gin 


Ser 


3 05 










310 










315 










320 


His 


Val 


Tyr 


Ala 


Leu 


Tyr 


He 


Val 


Ala 


Leu 


Cys 


Leu 


Ser 


Thr 


Leu 


Asn 










325 










33 0 










335 




Ser 


Cys 


lie 


Asp 


Pro 


Phe 


Val 


Tyr 


Tyr 


Phe 


Val 


Ser 


His 


Asp 


Phe 


Arg 








340 










345 










350 






Asp 


His 


Ala 


Lys 


Asn 


Ala 


Leu 


Leu 


Cys 


Arg 


Ser 


Val 


Arg 


Thr 


Val 


Lys 






355 










360 










365 








Gin 


Met 


Gin 


Val 


Ser 


Leu 


Thr 


Ser 


Lys 


Lys 


His 


Ser 


Arg 


Lys 


Ser 


Ser 




370 










375 










380 










Ser 


Tyr 


Ser 


Ser 


Ser 


Ser 


Thr 


Thr 


Val 


Lys 


Thr 


Ser 


Tyr 









385 390 395 



<210> 112 

<211> 153 

<212> PRT 

<213> Homo sapiens 

<400> 1X2 

Ala Pro Val Arg Ser Leu Asn Cys Thr Leu Arg Asp Ser Gin Gin Lys 

1 5 10 15 

Ser Leu Val Met Ser Gly Pro Tyr Glu Leu Lys Ala Leu His Leu Gin 

20 25 30 

Gly Gin Asp Met Glu Gin Gin Val Val Phe Ser Met Ser Phe Val Gin 

35 40 45 

Gly Glu Glu Ser Asn Asp Lys He Pro Val Ala Leu Gly Leu Lys Glu 

50 55 60 

Lys Asn Leu Tyr Leu Ser Cys Val Leu Lys Asp Asp Lys Pro Thr Leu 
65 70 75 80 

Gin Leu Glu Ser Val Asp Pro Lys Asn Tyr Pro Lys Lys Lys Met Glu 

85 90 95 

Lys Arg Phe Val Phe Asn Lys He Glu He Asn Asn Lys Leu Glu Phe 

100 105 110 

Glu Ser Ala Gin Phe Pro Asn Trp Tyr He Ser Thr Ser Gin Ala Glu 

115 120 125 

Asn Met Pro Val Phe Leu Gly Gly Thr Lys Gly Gly Gin Asp He Thr 

13 0 13 5 14 0 

Asp Phe Thr Met Gin Phe Val Ser Ser 
145 150 



<210> 113 

<211> 385 

<212> PRT 

<213> Homo sapiens 

<400> 113 

Met Trp Gly Arg Leu Leu Leu Trp Pro Leu Val Leu Gly Phe Ser Leu 
15 10 15 
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Ser 


Gly 


Gly 


Thr 
20 


Gin 


Thr 


Pro 


Ser 


Val 

25 


Tyr 


Asp 


Glu 


Ser 


Gly 
30 


Ser 


Thr 


Gly 


Gly 


Gly 
35 


Asp 


Asp 


Ser 


Thr 


Pro 
40 


Ser 


He 


Leu 


Pro 


Ala 
45 


Pro 


Arg 


Gly 


Tyr 


Pro 
50 


Gly 


Gin 


Val 


Cys 


Ala 
55 


Asn 


Asp 


Ser 


Asp 


Thr 
60 


Leu 


Glu 


Leu 


Pro 


Asp 


Ser 


Ser 


Arg 


Ala 


Leu 


Leu 


Leu 


Gly 


Trp 


Val 


Pro 


Thr 


Arg 


Leu 


Val 


65 










70 










75 










80 


Pro 


Ala 


Leu 


Tyr 


Gly 
85 


Leu 


Val 


Leu 


Val 


Val 
90 


Gly 


Leu 


Pro 


Ala 


Asn 
95 


Gly 


Leu 


Ala 


Leu 


Trp 
100 


Val 


Leu 


Ala 


Thr 


Gin 
105 


Ala 


Pro 


Arg 


Leu 


Pro 
110 


Ser 


Thr 


Met 


Leu 


Leu 
115 


Met 


Asn 


Leu 


Ala 


Thr 
120 


Ala 


Asp 


Leu 


Leu 


Leu 

125 


Ala 


Leu 


Ala 


Leu 


Pro 
130 


Pro 


Arg 


He 


Ala 


Tyr 
135 


His 


Leu 


Arg 


Gly 


Gin 
14 0 


Arg 


Trp 


Pro 


Phe 


Gly 


Glu 


Ala 


Ala 


Cys 


Arg 


Leu 


Ala 


Thr 


Ala 


Ala 


Leu 


Tyr 


Gly 


His 


Met 


145 










150 










155 










160 


Tyr 


Gly 


Ser 


Val 


Leu 
165 


Leu 


Leu 


Ala 


Ala 


Val 
170 


Ser 


Leu 


Asp 


Arg 


Tyr 
175 


Leu 


Ala 


Leu 


Val 


His 
180 


Pro 


Leu 


Arg 


Ala 


Arg 
185 


Ala 


Leu 


Arg 


Gly 


Arg 
190 


Arg 


Leu 


Ala 


Leu 


Gly 
195 


Leu 


Cys 


Met 


Ala 


Ala 
200 


Trp 


Leu 


Met 


Ala 


Ala 
205 


Ala 


Leu 


Ala 


Leu 


Pro 
210 


Leu 


Thr 


Leu 


Gin 


Arg 
215 


Gin 


Thr 


Phe 


Arg 


Leu 
220 


Ala 


Arg 


Ser 


Asp 


Arg 


Val 


Leu 


Cys 


His 


Asp 


Ala 


Leu 


Pro 


Leu 


Asp 


Ala 


Gin 


Ala 


Ser 


His 


225 










230 










235 










240 


Trp 


Gin 


Pro 


Ala 


Phe 
245 


Thr 


Cys 


Leu 


Ala 


Leu 
250 


Leu 


Gly 


Cys 


Phe 


Leu 
255 


Pro 


Leu 


Leu 


Ala 


Met 
260 


Leu 


Leu 


Cys 


Tyr 


Gly 
265 


Ala 


Thr 


Leu 


His 


Thr 
270 


Leu 


Ala 


Ala 


Ser 


Gly 
275 


Arg 


Arg 


Tyr 


Gly 


His 
280 


Ala 


Leu 


Arg 


Leu 


Thr 
285 


Ala 


Val 


Val 


Leu 


Ala 
290 


Ser 


Ala 


Val 


Ala 


Phe 
295 


Phe 


Val 


Pro 


Ser 


Asn 
300 


Leu 


Leu 


Leu 


Leu 


Leu 


His 


Tyr 


Ser 


Asp 


Pro 


Ser 


Pro 


Ser 


Ala 


Trp 


Gly 


Asn 


Leu 


Tyr 


Gly 


305 










310 










315 










320 


Ala 


Tyr 


Val 


Pro 


Ser 
325 


Leu 


Ala 


Leu 


Ser 


Thr 
330 


Leu 


Asn 


Ser 


Cys 


Val 
335 


Asp 


Pro 


Phe 


lie 


Tyr 
340 


Tyr 


Tyr 


Val 


Ser 


Ala 
345 


Glu 


Phe 


Arg 


Asp 


Lys 
350 


Val 


Arg 


Ala 


Gly 


Leu 
355 


Phe 


Gin 


Arg 


Ser 


Pro 
360 


Gly 


Asp 


Thr 


Val 


Ala 
365 


Ser 


Lys 


Ala 


Ser 


Ala 
370 


Glu 


Gly 


Gly 


Ser 


Arg 
375 


Gly 


Met 


Gly 


Thr 


His 
380 


Ser 


Ser 


Leu 


Leu 



Gin 



385 



<210> 114 

<211> 1338 

<212> PRT 

<213> Homo sapiens 

<400> 114 

Met Val Ser Tyr Trp Asp Thr Gly Val Leu Leu Cys Ala Leu Leu Ser 

15 10 15 

Cys Leu Leu Leu Thr Gly Ser Ser Ser Gly Ser Lys Leu Lys Asp Pro 

20 25 30 

Glu Leu Ser Leu Lys Gly Thr Gin His lie Met Gin Ala Gly Gin Thr 

35 40 45 

Leu His Leu Gin Cys Arg Gly Glu Ala Ala His Lys Trp Ser Leu Pro 
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50 55 60 



Glu 


Met 


Val 


Ser 


Lys 


Glu 


Ser 


Glu 


Arg 


Leu 


Ser 


He 


Thr 


Lys 


Ser 


Ala 


65 










70 










75 










80 


Cys 


Gly Arg 


Asn 


Gly Lys 


Gin 


Phe 


Cys 


Ser 


Thr 


Leu 


Thr 


Leu 


Asn 


Thr 










85 










90 










95 




Ala 


Gin 


Ala 


Asn 


His 


Thr Gly 


Phe 


Tyr 


Ser 


Cys 


Lys 


Tyr 


Leu 


Ala 


Val 








10 0 










105 










110 






Pro 


Thr 


Ser 


Lys 


Lys 


Lys 


Glu 


Thr 


Glu 


Ser 


Ala 


He 


Tyr 


He 


Phe 


He 






115 










120 










125 








Ser 


Asp 


Thr 


Gly 


Arg 


Pro 


Phe 


Val 


Glu 


Met 


Tyr 


Ser 


Glu 


He 


Pro 


Glu 




13 0 










135 










140 










lie 


# He 


His 


Met 


Thr 


Glu 


Gly 


Arg 


Glu 


Leu 


Val 


He 


Pro 


Cys 


Arg 


Val 


145 










150 










155 










160 


Thr 


Ser 


Pro 


Asn 


He 


Thr 


Val 


Thr 


Leu 


Lys 


Lys 


Phe 


Pro 


Leu 


Asp 


Thr 










165 










170 










175 




Leu 


He 


Pro 


Asp 


Gly 


Lys 


Arg 


He 


He 


Trp 


Asp 


Ser 


Arg 


Lys 


Gly 


Phe 








180 










185 










19 0 






lie 


He 


Ser 


Asn 


Ala 


Thr 


Tyr 


Lys 


Glu 


He 


Gly 


Leu 


Leu 


Thr 


Cys 


Glu 






195 










200 










205 








Ala 


Thr 


Val 


Asn 


Gly His 


Leu 


Tyr 


Lys 


Thr 


Asn 


Tyr 


Leu 


Thr 


His 


Arg 




210 










215 










220 










Gin 


Thr 


Asn 


Thr 


He 


He 


Asp 


Val 


Gin 


He 


Ser 


Thr 


Pro 


Arg 


Pro 


Val 


225 










230 










235 










240 


Lys 


Leu 


Leu 


Arg 


Gly 


His 


Thr 


Leu 


Val 


Leu 


Asn 


Cys 


Thr 


Ala 


Thr 


Thr 










245 










250 










255 




Pro 


Leu 


Asn 


Thr 


Arg 


Val 


Gin 


Met 


Thr 


Trp 


Ser 


Tyr 


Pro 


Asp 


Glu 


Lys 








260 










265 










270 






Asn 


Lys 


Arg 


Ala 


Ser 


Val 


Arg 


Arg 


Arg 


He 


Asp 


Gin 


Ser 


Asn 


Ser 


His 






275 










280 










285 








Ala 


Asn 


He 


Phe 


Tyr 


Ser 


Val 


Leu 


Thr 


He 


Asp 


Lys 


Met 


Gin 


Asn 


Lys 




290 










295 










300 










Asp 


Lys 


Gly 


Leu 


Tyr 


Thr 


Cys 


Arg 


Val 


Arg 


Ser 


Gly 


Pro 


Ser 


Phe 


Lys 


305 










310 










315 










320 


Ser 


Val 


Asn 


Thr 


Ser 


Val 


His 


He 


Tyr 


Asp 


Lys 


Ala 


Phe 


He 


Thr 


Val 










325 










330 










335 




Lys 


His 


Arg 


Lys 


Gin 


Gin 


Val 


Leu 


Glu 


Thr 


Val 


Ala 


Gly 


Lys 


Arg 


Ser 








340 










345 










350 






Tyr 


Arg 


Leu 


Ser 


Met 


Lys 


Val 


Lys 


Ala 


Phe 


Pro 


Ser 


Pro 


Glu 


Val 


Val 






355 










360 










365 








Trp 


Leu 


Lys 


Asp 


Gly 


Leu 


Pro 


Ala 


Thr 


Glu 


Lys 


Ser 


Ala 


Arg 


Tyr 


Leu 




370 










375 










380 










Thr 


Arg 


Gly 


Tyr 


Ser 


Leu 


He 


He 


Lys 


Asp 


Val 


Thr 


Glu 


Glu 


Asp 


Ala 


385 










390 










395 










400 


Gly Asn 


Tyr 


Thr 


He 


Leu 


Leu 


Ser 


He 


Lys 


Gin 


Ser 


Asn 


Val 


Phe 


Lys 










405 










410 










415 




Asn 


Leu 


Thr 


Ala 


Thr 


Leu 


He 


Val 


Asn 


Val 


Lys 


Pro 


Gin 


He 


Tyr 


Glu 








42 0 










425 










430 






Lys 


Ala 


Val 


Ser 


Ser 


Phe 


Pro 


Asp 


Pro 


Ala 


Leu 


Tyr 


Pro 


Leu 


Gly 


Ser 






435 










440 










445 








Arg 


Gin 


He 


Leu 


Thr 


Cys 


Thr 


Ala 


Tyr 


Gly 


He 


Pro 


Gin 


Pro 


Thr 


He 




450 










455 










460 










Lys 


Trp 


Phe 


Trp 


His 


Pro 


Cys 


Asn 


His 


Asn 


His 


Ser 


Glu 


Ala 


Arg 


Cys 


465 










470 










475 










480 


Asp 


Phe 


Cys 


Ser 


Asn 


Asn 


Glu 


Glu 


Ser 


Phe 


He 


Leu 


Asp 


Ala 


Asp 


Ser 










485 










490 










495 




Asn 


Met 


Gly 


Asn 


Arg 


He 


Glu 


Ser 


He 


Thr 


Gin 


Arg 


Met 


Ala 


He 


He 








500 










505 










510 






Glu 


Gly 


Lys 


Asn 


Lys 


Met 


Ala 


Ser 


Thr 


Leu 


Val 


Val 


Ala 


Asp 


Ser 


Arg 






515 










520 










525 








He 


Ser 


Gly 


He 


Tyr 


He 


Cys 


He 


Ala 


Ser 


Asn 


Lys 


Val 


Gly 


Thr 


Val 




530 










535 










540 










Gly Arg Asn 


He 


Ser 


Phe 


Tyr 


He 


Thr 


Asp 


Val 


Pro 


Asn 


Gly 


Phe 


His 


545 










550 










555 










560 



WO 2004/113522 



63 



PCT/EP2004/051173 



Val 


Asn 


Leu 


Glu 


Lys 


Met 


Pro 


Thr 


Glu 


Gly 


Glu 


Asp 


Leu 


Lys 


Leu 


Ser 










5 65 










570 










575 




Cys 


Thr 


Val 


Asn 


Lys 


Phe 


Leu 


Tyr 


Arg 


Asp 


Val 


Thr 


Trp 


He 


Leu 


Leu 








5 8 0 










585 










590 






Arg 


Thr 


Val 


Asn 


Asn 


Arg 


Thr 


Met 


His 


Tyr 


Ser 


He 


Ser 


Lys 


Gin 


Lys 






5 95 










600 










605 








Met 


Ala 


He 


Thr 


Lys 


Glu 


His 


Ser 


He 


Thr 


Leu 


Asn 


Leu 


Thr 


He 


Met 




c. i r\ 
biU 










615 










62 0 










Asn 


Val 


Ser 


Leu 


Gin 


Asp 


Ser 


Gly 


Thr 


Tyr 


Ala 


Cys 


Arg 


Ala 


Arg 


Asn 


62 5 










63 0 










635 










640 


Val 


Tyr 


Thr 


Gly 


Glu 


Glu 


He 


Leu 


Gin 


Lys 


Lys 


Glu 


He 


Thr 


He 


Arg 










645 










650 










655 




Asp 


Gin 


Glu 


Ala 


Pro 


Tyr 


Leu 


Leu 


Arg 


Asn 


Leu 


Ser 


Asp 


His 


Thr 


Val 








6 6 0 










665 










670 






Ala 


lie 


Ser 


Ser 


Ser 


Thr 


Thr 


Leu 


Asp 


Cys 


His 


Ala 


Asn 


Gly 


Val 


Pro 






6 75 










68 0 










685 








Glu 


Pro 


Gin 


He 


Thr 


Trp 


Phe 


Lys 


Asn 


Asn 


His 


Lys 


He 


Gin 


Gin 


Glu 




690 










695 










700 










Pro 


Gly 


He 


He 


Leu 


Gly 


Pro 


Gly 


Ser 


Ser 


Thr 


Leu 


Phe 


He 


Glu 


Arg 


705 










710 










715 










720 


Val 


Thr 


Glu 


Glu 


Asp 


Glu 


Gly 


Val 


Tyr 


His 


Cys 


Lys 


Ala 


Thr 


Asn 


Gin 










725 










730 










735 




Lys 


Gly 


Ser 


Val 


Glu 


Ser 


Ser 


Ala 


Tyr 


Leu 


Thr 


Val 


Gin 


Gly 


Thr 


Ser 








74 0 










745 










750 






Asp 


Lys 


Ser 


Asn 


Leu 


Glu 


Leu 


He 


Thr 


Leu 


Thr 


Cys 


Thr 


Cys 


Val 


Ala 






755 










7 6 0 










765 








Ala 


Thr 


Leu 


Phe 


Trp 


Leu 


Leu 


Leu 


Thr 


Leu 


Leu 


He 


Arg 


Lys 


Met 


Lys 




770 










775 










780 










Arg 


Ser 


Ser 


Ser 


Glu 


He 


Lys 


Thr 


Asp 


Tyr 


Leu 


Ser 


He 


He 


Met 


Asp 


785 










790 










795 










800 


Pro 


Asp 


Glu 


Val 


Pro 


Leu 


Asp 


Glu 


Gin 


Cys 


Glu 


Arg 


Leu 


Pro 


Tyr 


Asp 










805 










810 










815 




Ala 


Ser 


Lys 


Trp 


Glu 


Phe 


Ala 


Arg 


Glu 


Arg 


Leu 


Lys 


Leu 


Gly 


Lys 


Ser 








820 










825 










830 






Leu 


Gly Arg 


Gly Ala 


Phe 


Gly 


Lys 


Val 


Val 


Gin 


Ala 


Ser 


Ala 


Phe 


Gly 






835 










840 










845 








lie 


Lys 


Lys 


Ser 


Pro 


Thr 


Cys 


Arg 


Thr 


Val 


Ala 


Val 


Lys 


Met 


Leu 


Lys 




850 










855 










860 










Glu 


Gly Ala 


Thr 


Ala 


Ser 


Glu 


Tyr 


Lys 


Ala 


Leu 


Met 


Thr 


Glu 


Leu 


Lys 


865 










870 










875 










880 


lie 


Leu 


Thr 


His 


He 


Gly 


His 


His 


Leu 


Asn 


Val 


Val 


Asn 


Leu 


Leu 


Giy 










885 










890 










895 




Ala 


Cys 


Thr 


Lys 


Gin 


Gly 


Gly 


Pro 


Leu 


Met 


Val 


He 


Val 


Glu 


Tyr 


Cys 








900 










905 










910 






Lys 


Tyr 


Gly 


Asn 


Leu 


Ser 


Asn 


Tyr 


Leu 


Lys 


Ser 


Lys 


Arg 


Asp 


Leu 


Phe 






915 










92 0 










925 








Phe 


Leu 


Asn 


Lys 


Asp 


Ala 


Ala 


Leu 


His 


Met 


Glu 


Pro 


Lys 


Lys 


Glu 


Lys 




930 










93 5 










940 










Met 


Glu 


Pro 


Gly Leu 


Glu 


Gin 


Gly 


Lys 


Lys 


Pro 


Arg 


Leu 


Asp 


Ser 


Val 


945 










950 










955 










960 


Thr 


Ser 


Ser 


Glu 


Ser 


Phe 


Ala 


Ser 


Ser 


Gly 


Phe 


Gin 


Glu 


Asp 


Lys 


Ser 










965 










970 










975 




Leu 


Ser 


Asp 


Val 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 


Asp 


Gly 


Phe 


Tyr 


Lys 


Glu 








980 










985 










990 






Pro 


He 


Thr 


Met 


Glu 


Asp 


Leu 


He 


Ser Tyr Ser Phe Gin Val Ala Ar 






995 










1000 








1005 







Gly Met Glu Phe Leu Ser Ser Arg Lys Cys He His Arg Asp Leu 

1010 1015 1020 

Ala Ala Arg Asn He Leu Leu Ser Glu Asn Asn Val Val Lys He 

1025 1030 1035 

Cys Asp Phe Gly Leu Ala Arg Asp He Tyr Lys Asn Pro Asp Tyr 

1040 1045 1050 

Val Arg Lys Gly Asp Thr Arg Leu Pro Leu Lys Trp Met Ala Pro 
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1055 










1060 










1065 








Glu 


Ser 
1070 


He 


Phe 


Asp 


Lys 


He 
1075 


Tyr 


Ser 


Thr 


Lys 


Ser 
1080 


Asp 


Val 


Trp 


Ser 


Tyr 
1085 


Gly 


Val 


Leu 


Leu 


Trp 
1090 


Glu 


He 


Phe 


Ser 


Leu 
1095 


Gly 


Gly 


Ser 


Pro 


Tyr 


Pro 


Gly Val 


Gin 


Met 


Asp 


Glu 


Asp 


Phe 


Cys 


Ser 


Arg 


Leu 




1100 










1105 










1110 








Arg 


Glu 
1115 


Gly 


Met 


Arg 


Met 


Arg 
1120 


Ala 


Pro 


Glu 


Tyr 


Ser 
1125 


Thr 


Pro 


Glu 


lie 


Tyr 
1130 


Gin 


He 


Met 


Leu 


Asp 
1135 


Cys 


Trp 


His 


Arg 


Asp 
1140 


Pro 


Lys 


Glu 


Arg 


Pro 
1145 


Arg 


Phe 


Ala 


Glu 


Leu 
1150 


Val 


Glu 


Lys 


Leu 


Gly 
1155 


Asp 


Leu 


Leu 


Gin 


Ala 
1160 


Asn 


Val 


Gin 


Gin 


Asp 
1165 


Gly 


Lys 


Asp 


Tyr 


He 
1170 


Pro 


He 


Asn 


Ala 


He 
1175 


Leu 


Thr 


Gly 


Asn 


Ser 
1180 


Gly 


Phe 


Thr 


Tyr 


Ser 
1185 


Thr 


Pro 


Ala 


Phe 


Ser 
1190 


Glu 


Asp 


Phe 


Phe 


Lys 
1195 


Glu 


Ser 


He 


Ser 


Ala 
1200 


Pro 


Lys 


Phe 


Asn 


Ser 
1205 


Gly 


Ser 


Ser 


Asp 


Asp 
1210 


Val 


Arg 


Tyr 


Val 


Asn 
1215 


Ala 


Phe 


Lys 


Phe 


Met 
1220 


Ser 


Leu 


Glu 


Arg 


He 
1225 


Lys 


Thr 


Phe 


Glu 


Glu 
1230 


Leu 


Leu 


Pro 


Asn 


Ala 


Thr 


Ser 


Met 


Phe 


Asp 


Asp 


Tyr 


Gin 


Gly Asp 


Ser 


Ser 


Thr 




1235 










1240 










1245 








Leu 


Leu 
1250 


Ala 


Ser 


Pro 


Met 


Leu 
1255 


Lys 


Arg 


Phe 


Thr 


Trp 
1260 


Thr 


Asp 


Ser 


Lys 


Pro 
1265 


Lys 


Ala 


Ser 


Leu 


Lys 
1270 


•lie 


Asp 


Leu 


Arg 


Val 
1275 


Thr 


Ser 


Lys 


Ser 


Lys 
1280 


Glu 


Ser 


Gly 


Leu 


Ser 
1285 


Asp 


Val 


Ser 


Arg 


Pro 
1290 


Ser 


Phe 


Cys 


His 


Ser 
1295 


Ser 


Cys 


Gly 


His 


Val 
1300 


Ser 


Glu 


Gly 


Lys 


Arg 
1305 


Arg 


Phe 


Thr 


Tyr 


Asp 
1310 


His 


Ala 


Glu 


Leu 


Glu 
1315 


Arg 


Lys 


He 


Ala 


Cys 
1320 


Cys 


Ser 


Pro 


Pro 


Pro 
1325 


Asp 


Tyr 


Asn 


Ser 


Val 
1330 


Val 


Leu 


Tyr 


Ser 


Thr 
1335 


Pro 


Pro 


He 



<210> 115 

<211> 1356 

<212> PRT 

<213> Homo sapiens 



<400> 115 

Met Gin Ser Lys 

1 

Thr Arg Ala Ala 
20 

Arg Leu Ser He 
35 

Leu Gin He Thr 
50 

Asn Asn Gin Ser 
65 

Asp Gly Leu Phe 

Asp Thr Gly Ala 
100 

Val He Tyr Val 
115 

Val Ser Asp Gin 
130 



Val Leu Leu Ala 
5 

Ser Val Gly Leu 

Gin Lys Asp He 
40 

Cys Arg Gly Gin 
55 

Gly Ser Glu Gin 
70 

Cys Lys Thr Leu 
85 

Tyr Lys Cys Phe 

Tyr Val Gin Asp 
120 

His Gly Val Val 
135 



Val Ala Leu Trp 
10 

Pro Ser Val Ser 
25 

Leu Thr He Lys 

Arg Asp Leu Asp 
60 

Arg Val Glu Val 
75 

Thr He Pro Lys 
90 

Tyr Arg Glu Thr 
105 

Tyr Arg Ser Pro 

Tyr He Thr Glu 
140 



Leu Cys Val Glu 
15 

Leu Asp Leu Pro 
30 

Ala Asn Thr Thr 
45 

Trp Leu Trp Pro 

Thr Glu Cys Ser 
80 

Val He Gly Asn 
95 

Asp Leu Ala Ser 
110 

Phe He Ala Ser 
125 

Asn Lys Asn Lys 
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Thr 


Val 


Val 


He 


Pro 


Cys 


Leu 


Gly 


Ser 


He 


Ser 


Asn 


Leu 


Asn 


Val 


Ser 


145 










150 










155 










160 


Leu 


Cys 


Ala 


Arg 


Tyr 


Pro 


Glu 


Lys 


Arg 


Phe 


Val 


Pro 


Asp 


Gly 


Asn 


Arg 










165 










170 










175 




lie 


Ser 


Trp 


Asp 


Ser 


Lys 


Lys 


Gly 


Phe 


Thr 


He 


Pro 


Ser 


Tyr 


Met 


He 








18 0 










185 










190 






Ser 


Tyr 


Ala 


Gly 


Met 


Val 


Phe 


Cys 


Glu 


Ala 


Lys 


He 


Asn 


Asp 


Glu 


Ser 






195 










2 0 0 










205 








Tyr 


Gin 


Ser 


He 


Met 


Tyr 


He 


Val 


Val 


Val 


Val 


Gly 


Tyr 


Arg 


He 


Tyr 




210 










215 










220 










Asp 


Val 


Val 


Leu 


Ser 


Pro 


Ser 


His 


Gly 


He 


Glu 


Leu 


Ser 


Val 


Gly Glu 


225 










230 










235 










240 


Lys 


Leu 


Val 


Leu 


Asn 


Cys 


Thr 


Ala 


Arg 


Thr 


Glu 


Leu 


Asn 


Val 


Gly 


He 










245 










250 










255 




Asp 


Phe 


Asn 


Trp 


Glu 


Tyr 


Pro 


Ser 


Ser 


Lys 


His 


Gin 


His 


Lys 


Lys 


Leu 








26 0 










265 










270 






Val 


Asn 


Arg 


Asp 


Leu 


Lys 


Thr 


Gin 


Ser 


Gly 


Ser 


Glu 


Met 


Lys 


Lys 


Phe 






2 75 










280 










285 








Leu 


Ser 


Thr 


Leu 


Thr 


He 


Asp 


Gly 


Val 


Thr 


Arg 


Ser 


Asp 


Gin 


Gly Leu 




2 9 0 










295 










300 










Tyr 


Thr 


Cys 


Ala 


Ala 


Ser 


Ser 


Gly 


Leu 


Met 


Thr 


Lys 


Lys 


Asn 


Ser 


Thr 


305 










310 










315 










320 


Phe 


Val 


Arg 


Val 


His 


Glu 


Lys 


Pro 


Phe 


Val 


Ala 


Phe 


Gly 


Ser 


Gly Met 










325 










330 










335 




Glu 


Ser 


Leu 


Val 


Glu 


Ala 


Thr 


Val 


Gly 


Glu 


Arg 


Val 


Arg 


He 


Pro 


Ala 








340 










345 










350 






Lys 


Tyr 


Leu 


Gly 


Tyr 


Pro 


Pro 


Pro 


Glu 


He 


Lys 


Trp 


Tyr 


Lys 


Asn 


Gly 






355 










360 










365 








lie 


Pro 


Leu 


Glu 


Ser 


Asn 


His 


Thr 


He 


Lys 


Ala 


Gly 


His 


Val 


Leu 


Thr 




370 










375 










380 










He 


Met 


Glu 


Val 


Ser 


Glu 


Arg 


Asp 


Thr 


Gly 


Asn 


Tyr 


Thr 


Val 


He 


Leu 


385 










390 










395 










400 


Thr 


Asn 


Pro 


He 


Ser 


Lys 


Glu 


Lys 


Gin 


Ser 


His 


Val 


Val 


Ser 


Leu 


Val 










405 










410 










415 




Val 


Tyr 


Val 


Pro 


Pro 


Gin 


He 


Gly 


Glu 


Lys 


Ser 


Leu 


He 


Ser 


Pro 


Val 








42 0 










425 










430 






Asp 


Ser 


Tyr 


Gin 


Tyr 


Gly 


Thr 


Thr 


Gin 


Thr 


Leu 


Thr 


Cys 


Thr 


Val 


Tyr 






43 5 










440 










445 








Ala 


He 


Pro 


Pro 


Pro 


His 


His 


He 


His 


Trp 


Tyr 


Trp 


Gin 


Leu 


Glu 


Glu 




450 










455 










460 










Glu 


Cys 


Ala 


Asn 


Glu 


Pro 


Ser 


Gin 


Ala 


Val 


Ser 


Val 


Thr 


Asn 


Pro 


Tyr 


4 65 










470 










475 










480 


Pro 


Cys 


Glu 


Glu 


Trp 


Arg 


Ser 


Val 


Glu 


Asp 


Phe 


Gin 


Gly 


Gly 


Asn 


Lys 










485 










490 










495 




He 


Glu 


Val 


Asn 


Lys 


Asn 


Gin 


Phe 


Ala 


Leu 


He 


Glu 


Gly 


Lys 


Asn 


Lys 








5 0 0 










505 










510 






Thr 


Val 


Ser 


Thr 


Leu 


Val 


He 


Gin 


Ala 


Ala 


Asn 


Val 


Ser 


Ala 


Leu 


Tyr 






515 










520 










525 






Lys 


Cys 


Glu 


Ala 


Val 


Asn 


Lys 


Val 


Gly 


Arg 


Gly 


Glu 


Arg 


Val 


He 


Ser 




53 0 










535 










540 










Phe 


His 


Val 


Thr 


Arg 


Gly 


Pro 


Glu 


He 


Thr 


Leu 


Gin 


Pro 


Asp 


Met 


Gin 


545 










550 










555 










560 


Pro 


Thr 


Glu 


Gin 


Glu 


Ser 


Val 


Ser 


Leu 


Trp 


Cys 


Thr 


Ala 


Asp 


Arg 


Ser 










56 5 










570 










575 




Thr 


Phe 


Glu 


Asn 


Leu 


Thr 


Trp 


Tyr 


Lys 


Leu 


Gly 


Pro 


Gin 


Pro 


Leu 


Pro 








5 8 0 










5 85 










590 






He 


His 


Val 


Gly 


Glu 


Leu 


Pro 


Thr 


Pro 


Val 


Cys 


Lys 


Asn 


Leu 


Asp 


Thr 






595 










600 










605 








Leu 


Trp 


Lys 


Leu 


Asn 


Ala 


Thr 


Met 


Phe 


Ser 


Asn 


Ser 


Thr 


Asn 


Asp 


He 




610 










615 










620 










Leu 


He 


Met 


Glu 


Leu 


Lys 


Asn 


Ala 


Ser 


Leu 


Gin 


Asp 


Gin 


Gly 


Asp 


Tyr 


625 










630 










635 










640 


Val 


Cys 


Leu 


Ala 


Gin 


Asp 


Arg 


Lys 


Thr 


Lys 


Lys 


Arg 


His 


Cys 


Val 


Val 
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645 










650 










655 




Arg 


Gin 


Leu 


Thr 


Val 


Leu 


Glu 


Arg 


Val 


Ala 


Pro 


Thr 


He 


Thr 


Gly Asn 








660 










665 










670 






Leu 


Glu 


Asn 


Gin 


Thr 


Thr 


Ser 


He 


Gly 


Glu 


Ser 


He 


Glu 


Val 


Ser 


Cys 






675 










680 










685 






Thr 


Ala 


Ser 


Gly Asn 


Pro 


Pro 


Pro 


Gin 


He 


Met 


Trp 


Phe 


Lys 


Asp 


Asn 




690 










695 










700 










Glu 


Thr 


Leu 


Val 


Glu 


Asp 


Ser 


Gly 


He 


Val 


Leu 


Lys 


Asp 


Gly Asn Arg 


705 










710 










715 










720 


Asn 


Leu 


Thr 


He 


Arg 


Arg 


Val 


Arg 


Lys 


Glu 


Asp 


Glu 


Gly 


Leu 


Tyr 


Thr 










725 










730 










735 




Cys 


Gin 


Ala 


Cys 


Ser 


Val 


Leu 


Gly 


Cys 


Ala 


Lys 


Val 


Glu 


Ala 


Phe 


Phe 








740 










745 










750 






lie 


He 


Glu 


Gly Ala 


Gin 


Glu 


Lys 


Thr 


Asn 


Leu 


Glu 


He 


He 


He 


Leu 






75 5 










760 










765 








Val 


Gly 


Thr 


Ala 


Val 


He 


Ala 


Met 


Phe 


Phe 


Trp 


Leu 


Leu 


Leu 


Val 


He 




77 0 










77 5 










780 










lie 


Leu 


Arg 


Thr 


Val 


Lys 


Arg 


Ala 


Asn 


Gly 


Gly 


Glu 


Leu 


Lys 


Thr 


Gly 


785 










790 










795 










800 


Tyr 


Leu 


Ser 


He 


Val 


Met 


Asp 


Pro 


Asp 


Glu 


Leu 


Pro 


Leu 


Asp 


Glu 


His 










805 










810 










815 




Cys 


Glu 


Arg 


Leu 


Pro 


Tyr 


Asp 


Ala 


Ser 


Lys 


Trp 


Glu 


Phe 


Pro 


Arg 


Asp 








82 0 










825 










830 






Arg 


Leu 


Lys 


Leu 


Gly 


Lys 


Pro 


Leu 


Gly Arg 


Gly 


Ala 


Phe 


Gly 


Gin 


Val 






83 5 










840 










845 








He 


Glu 


Ala 


Asp 


Ala 


Phe 


Gly 


He 


Asp 


Lys 


Thr 


Ala 


Thr 


Cys 


Arg 


Thr 




8 50 










855 










860 










Val 


Ala 


Val 


Lys 


Met 


Leu 


Lys 


Glu 


Gly Ala 


Thr 


His 


Ser 


Glu 


His 


Arg 


865 










870 










875 










880 


Ala 


Leu 


Met 


Ser 


Glu 


Leu 


Lys 


He 


Leu 


He 


His 


He 


Gly 


His 


His 


Leu 










885 










890 










895 




Asn 


Val 


Val 


Asn 


Leu 


Leu 


Gly 


Ala 


Cys 


Thr 


Lys 


Pro 


Gly 


Gly 


Pro 


Leu 








900 










905 










910 






Met 


Val 


lie 


Val 


Glu 


Phe 


Cys 


Lys 


Phe 


Gly 


Asn 


Leu 


Ser 


Thr 


Tyr 


Leu 






915 










920 










925 








Arg 


Ser 


Lys 


Arg 


Asn 


Glu 


Phe 


Val 


Pro 


Tyr 


Lys 


Thr 


Lys 


Gly 


Ala 


Arg 




_3 -J U 










93 5 










940 










Phe 


Arg 


"Gin 


Gly 


Lys 


Asp 


Tyr 


Val 


Gly 


Ala 


He 


Pro 


Val 


Asp 


Leu 


Lys 


945 










950 










955 










960 


Arg 


Arg 


Leu 


Asp 


Ser 


He 


Thr 


Ser 


Ser 


Gin 


Ser 


Ser 


Ala 


Ser 


Ser 


Gly 










965 










970 










975 


Phe 


Val 


Glu 


Glu 


Lys 


Ser 


Leu 


Ser 


Asp 


Val 


Glu 


Glu 


Glu 


Glu 


Ala 


Pro 








980 










985 










990 






Glu 


Asp 


Leu 


Tyr 


Lys 


Asp 


Phe 


Leu 


Thr Leu Glu His 


! Leu He Cys Ty 



995 1000 1005 



Ser 


Phe 
1010 


Gin 


Val 


Ala 


Lys 


Gly 
1015 


Met 


Glu 


Phe 


Leu 


Ala 
1020 


Ser 


Arg 


Lys 


Cys 


He 
1025 


His 


Arg 


Asp 


Leu 


Ala 
1030 


Ala 


Arg 


Asn 


He 


Leu 
1035 


Leu 


Ser 


Glu 


Lys 


Asn 
1040 


Val 


Val 


Lys 


He 


Cys 
1045 


Asp 


Phe 


Gly 


Leu 


Ala 
1050 


Arg 


Asp 


He 


Tyr 


Lys 
1055 


Asp 


Pro 


Asp 


Tyr 


Val 
1060 


Arg 


Lys 


Gly 


Asp 


Ala 
1065 


Arg 


Leu 


Pro 


Leu 


Lys 
1070 


Trp 


Met 


Ala 


Pro 


Glu 
1075 


Thr 


He 


Phe 


Asp 


Arg 
1080 


Val 


Tyr 


Thr 


He 


Gin 
1085 


Ser 


Asp 


Val 


Trp 


Ser 
1090 


Phe 


Gly 


Val 


Leu 


Leu 
1095 


Trp 


Glu 


He 


Phe 


Ser 
1100 


Leu 


Gly 


Ala 


Ser 


Pro 
1105 


Tyr 


Pro 


Gly 


Val 


Lys 
1110 


He 


Asp 


Glu 


Glu 


Phe 
1115 


Cys 


Arg 


Arg 


Leu 


Lys 
1120 


Glu 


Gly 


Thr 


Arg 


Met 
1125 


Arg 


Ala 


Pro 


ASp 


Tyr 
1130 


Thr 


Thr 


Pro 


Glu 


Met 
1135 


Tyr 


Gin 


Thr 


Met 


Leu 
1140 


Asp 


Cys 


Trp 
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His 


Gly 
1145 


Glu 


Pro 


Ser 


Gin 


Arg 
1150 


Pro 


Thr 


Phe 


Ser 


Glu 
1155 


Leu 


Val 


Glu 


His 


Leu 
1160 


Gly 


Asn 


Leu 


Leu 


Gin 
1165 


Ala 


Asn 


Ala 


Gin 


Gin 
1170 


Asp 


Gly 


Lys 


Asp 


Tyr 
1175 


He 


Val 


Leu 


Pro 


He 
1180 


Ser 


Glu 


Thr 


Leu 


Ser 
1185 


Met 


Glu 


Glu 


Asp 


Ser 
1190 


Gly 


Leu 


Ser 


Leu 


Pro 
1195 


Thr 


Ser 


Pro 


Val 


Ser 
1200 


Cys 


Met 


Glu 


Glu 


Glu 
1205 


Glu 


Val 


Cys 


Asp 


Pro 
1210 


Lys 


Phe 


His 


Tyr 


Asp 
1215 


Asn 


Thr 


Ala 


Gly 


lie 
1220 


Ser 


Gin 


Tyr 


Leu 


Gin 
1225 


Asn 


Ser 


Lys 


Arg 


Lys 
1230 


Ser 


Arg 


Pro 


Val 


Ser 
1235 


Val 


Lys 


Thr 


Phe 


Glu 
1240 


Asp 


He 


Pro 


Leu 


Glu 
1245 


Glu 


Pro 


Glu 


Val 


Lys 
1250 


Val 


He 


Pro 


Asp 


Asp 
1255 


Asn 


Gin 


Thr 


Asp 


Ser 
1260 


Gly 


Met 


Val 


Leu 


Ala 
1265 


Ser 


Glu 


Glu 


Leu 


Lys 
1270 


Thr 


Leu 


Glu 


Asp 


Arg 
1275 


Thr 


Lys 


Leu 


Ser 


Pro 
1280 


Ser 


Phe 


Gly 


Gly 


Met 
1285 


Val 


Pro 


Ser 


Lys 


Ser 
1290 


Arg 


Glu 


Ser 


Val 


Ala 


Ser 


Glu 


Gly 


Ser 


Asn 


Gin 


Thr 


Ser 


Gly Tyr 


Gin 


Ser 


Gly 




1295 










1300 










1305 








Tyr 


His 
1310 


Ser 


Asp 


Asp 


Thr 


Asp 
1315 


Thr 


Thr 


Val 


Tyr 


Ser 
1320 


Ser 


Glu 


Glu 


Ala 


Glu 
1325 


Leu 


Leu 


Lys 


Leu 


He 
1330 


Glu 


He 


Gly 


Val 


Gin 
1335 


Thr 


Gly 


Ser 


Thr 


Ala 
1340 


Gin 


He 


Leu 


Gin 


Pro 
1345 


Asp 


Ser 


Gly 


Thr 


Thr 
1350 


Leu 


Ser 


Ser 



Pro Pro Val 
1355 



<210> 116 

<211> 1186 

<212> PRT 

<213> Homo sapiens 

<400> 116 



Leu 


Glu 


Glu 


Lys 


Lys 


Val 


Cys 


Gin 


Gly 


Thr 


Ser 


Asn 


Lys 


Leu 


Thr 


Gin 


1 








5 










10 










15 




Leu 


Gly 


Thr 


Phe 


Glu 


Asp 


His 


Phe 


Leu 


Ser 


Leu 


Gin 


Arg 


Met 


Phe 


Asn 








20 










25 










30 






Asn 


Cys 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Leu 


Glu 


He 


Thr 


Tyr 


Val 


Gin 


Arg 






35 










40 










45 








Asn 


Tyr 


Asp 


Leu 


Ser 


Phe 


Leu 


Lys 


Thr 


He 


Gin 


Glu 


Val 


Ala 


Gly 


Tyr 




50 










55 










60 










Val 


Leu 


He 


Ala 


Leu 


Asn 


Thr 


Val 


Glu 


Arg 


He 


Pro 


Leu 


Glu 


Asn 


Leu 


65 










70 










75 










80 


Gin 


He 


He 


Arg 


Gly 


Asn 


Met 


Tyr 


Tyr 


Glu 


Asn 


Ser 


Tyr 


Ala 


Leu 


Ala 










85 










90 










95 




Val 


Leu 


Ser 


Asn 


Tyr 


Asp 


Ala 


Asn 


Lys 


Thr 


Gly 


Leu 


Lys 


Glu 


Leu 


Pro 








100 










105 










110 






Met 


Arg 


Asn 


Leu 


Gin 


Glu 


He 


Leu 


His 


Gly Ala Val 


Arg 


Phe 


Ser 


Asn 






115 










120 










125 








Asn 


Pro 


Ala 


Leu 


Cys 


Asn 


Val 


Glu 


Ser 


He 


Gin 


Trp 


Arg 


Asp 


He 


Val 




130 










135 










140 










Ser 


Ser 


Asp 


Phe 


Leu 


Ser 


Asn 


Met 


Ser 


Met 


Asp 


Phe 


Gin 


Asn 


His 


Leu 


145 










150 










155 










160 


Gly 


Ser 


Cys 


Gin 


Lys 


Cys 


Asp 


Pro 


Ser 


Cys 


Pro 


Asn 


Gly 


Ser 


Cys 


Trp 










165 










170 










175 




Gly Ala 


Gly 


Glu 


Glu 


Asn 


Cys 


Gin 


Lys 


Leu 


Thr 


Lys 


He 


He 


Cys 


Ala 








180 










185 










190 






Gin 


Gin 


Cys 


Ser 


Gly 


Arg 


Cys 


Arg 


Gly 


Lys 


Ser 


Pro 


Ser 


Asp 


Cys 


Cys 
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195 

His Asn Gin Cys 
210 

Leu Val Cys Arg 
225 

Pro Pro Leu Met 

Pro Glu Gly Lys 
260 

Arg Asn Tyr Val 
275 

Ala Asp Ser Tyr 
290 

Cys Glu Gly Pro 
3 0 5 

Phe Lys Asp Ser 

Asn Cys Thr Ser 
340 

Arg Gly Asp Ser 
355 

Asp lie Leu Lys 
370 

Ala Trp Pro Glu 
385 

lie lie Arg Gly 

Val Ser Leu Asn 
420 

Ser Asp Gly Asp 
435 

Asn Thr lie Asn 
450 

Lys lie lie Ser 
465 

Val Cys His Ala 

Arg Asp Cys Val 
500 

Asp Lys Cys Asn 
515 

Ser Glu Cys lie 
530 

lie Thr Cys Thr 
545 

Tyr lie Asp Gly 

Gly Glu Asn Asn 
580 

Cys His Leu Cys 
595 

Leu Glu Gly Cys 
610 

Gly Met Val Gly 
625 

Gly Leu Phe Met 

Arg Leu Leu Gin 
660 

Glu Ala Pro Asn 
675 

Lys Lys lie Lys 
690 



200 

Ala Ala Gly Cys 
215 

Lys Phe Arg Asp 
23 0 

Leu Tyr Asn Pro 
245 

Tyr Ser Phe Gly 

Val Thr Asp His 
280 

Glu Met Glu Glu 
295 

Cys Arg Lys Val 
310 

Leu Ser lie Asn 
325 

lie Ser Gly Asp 

Phe Thr His Thr 
360 

Thr Val Lys Glu 
375 

Asn Arg Thr Asp 
390 

Arg Thr Lys Gin 
405 

lie Thr Ser Leu 

Val lie lie Ser 
440 

Trp Lys Lys Leu 
455 

Asn Arg Gly Glu 
470 

Leu Cys Ser Pro 
485 

Ser Cys Arg Asn 

Leu Leu Glu Gly 
520 

Gin Cys His Pro 
535 

Gly Arg Gly Pro 
550 

Pro His Cys Val 
565 

Thr Leu Val Trp 

His Pro Asn Cys 
600 

Pro Thr Asn Gly 
615 

Ala Leu Leu Leu 
630 

Arg Arg Arg His 
645 

Glu Arg Glu Leu 

Gin Ala Leu Leu 
680 

Val Leu Gly Ser 
695 



Thr Gly Pro Arg 
220 

Glu Ala Thr Cys 
235 

Thr Thr Tyr Gin 
25 0 

Ala Thr Cys Val 
265 

Gly Ser Cys Val 

Asp Gly Val Arg 
300 

Cys Asn Gly lie 
315 

Ala Thr Asn lie 
330 

Leu His lie Leu 
345 

Pro Pro Leu Asp 

He Thr Gly Phe 
380 

Leu His Ala Phe 
395 

His Gly Gin Phe 
410 

Gly Leu Arg Ser 
425 

Gly Asn Lys Asn 

Phe Gly Thr Ser 
460 

Asn Ser Cys Lys 
475 

Glu Gly Cys Trp 
490 

Val Ser Arg Gly 
505 

Glu Pro Arg Glu 

Glu Cys Leu Pro 
540 

Asp Asn Cys He 
555 

Lys Thr Cys Pro 
570 

Lys Tyr Ala Asp 
585 

Thr Tyr Gly Cys 

Pro Lys He Pro 
620 

Leu Leu Val Val 
635 

He Val Arg Lys 
650 

Val Glu Pro Leu 
665 

Arg He Leu Lys 

Gly Ala Phe Gly 
700 



205 

Glu Ser Asp Cys 

Lys Asp Thr Cys 
240 

Met Asp Val Asn 
255 

Lys Lys Cys Pro 
270 

Arg Ala Cys Gly 
285 

Lys Cys Lys Lys 

Gly He Gly Glu 
320 

Lys His Phe Lys 
335 

Pro Val Ala Phe 
350 

Pro Gin Glu Leu 
365 

Leu Leu He Gin 

Glu Asn Leu Glu 
400 

Ser Leu Ala Val 
415 

Leu Lys Glu He 
430 

Leu Cys Tyr Ala 
445 

Gly Gin Lys Thr 

Ala Thr Gly Gin 
480 

Gly Pro Glu Pro 
495 

Arg Glu Cys Val 
510 

Phe Val Glu Asn 
525 

Gin Ala Met Asn 

Gin Cys Ala His 
560 

Ala Gly Val Met 
575 

Ala Gly His Val 
590 

Thr Gly Pro Gly 
605 

Ser He Ala Thr 

Ala Leu Gly He 
640 

Arg Thr Leu Arg 
655 

Thr Pro Ser Gly 
670 

Glu Thr Glu Phe 
685 

Thr Val Tyr Lys 
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Gly Leu Trp lie Pro Glu Gly Glu Lys Val Lys lie Pro Val Ala lie 
705 710 715 720 

Lys Glu Leu Arg Glu Ala Thr Ser Pro Lys Ala Asn Lys Glu lie Leu 

725 730 735 

Asp Glu Ala Tyr Val Met Ala Ser Val Asp Asn Pro His Val Cys Arg 

740 745 750 

Leu Leu Gly lie Cys Leu Thr Ser Thr Val Gin Leu lie Thr Gin Leu 

755 760 765 

Met Pro Phe Gly Cys Leu Leu Asp Tyr Val Arg Glu His Lys Asp Asn 

770 775 780 

lie Gly Ser Gin Tyr Leu Leu Asn Trp Cys Val Gin lie Ala Lys Gly 
785 790 795 " 800 

Met Asn Tyr Leu Glu Asp Arg Arg Leu Val His Arg Asp Leu Ala Ala 

805 810 815 

Arg Asn Val Leu Val Lys Thr Pro Gin His Val Lys lie Thr Asp Phe 

820 825 830 

Gly Leu Ala Lys Leu Leu Gly Ala Glu Glu Lys Glu Tyr His Ala Glu 

835 840 845 

Gly Gly Lys Val Pro lie Lys Trp Met Ala Leu Glu Ser lie Leu His 

850 855 860 

Arg He Tyr Thr His Gin Ser Asp Val Trp Ser Tyr Gly Val Thr Val 
865 870 875 880 

Trp Glu Leu Met Thr Phe Gly Ser Lys Pro Tyr Asp Gly He Pro Ala 

885 890 895 

Ser Glu He Ser Ser He Leu Glu Lys Gly Glu Arg Leu Pro Gin Pro 

900 905 910 

Pro He Cys Thr He Asp Val Tyr Met He Met Val Lys Cys Trp Met 

915 920 925 

He Asp Ala Asp Ser Arg Pro Lys Phe Arg Glu Leu He He Glu Phe 

930 935 940 

Ser Lys Met Ala Arg Asp Pro Gin Arg Tyr Leu Val He Gin Gly Asp 
945 950 955 960 

Glu Arg Met His Leu Pro Ser Pro Thr Asp Ser Asn Phe Tyr Arg Ala 

965 970 975 

Leu Met Asp Glu Glu Asp Met Asp Asp Val Val Asp Ala Asp Glu Tyr 

980 985 990 

Leu He Pro Gin Gin Gly Phe Phe Ser Ser Pro Ser Thr Ser Arg Thr 

995 1000 1005 

Pro Leu Leu Ser Ser Leu Ser Ala Thr Ser Asn Asn Ser Thr Val 

1010 1015 1020 

Ala Cys He Asp Arg Asn Gly Leu Gin Ser Cys Pro He Lys Glu 

1025 1030 1035 

Asp Ser Phe Leu Gin Arg Tyr Ser Ser Asp Pro Thr Gly Ala Leu 

1040 1045 1050 

Thr Glu Asp Ser He Asp Asp Thr Phe Leu Pro Val Pro Glu Tyr 

1055 1060 1065 

He Asn Gin Ser Val Pro Lys Arg Pro Ala Gly Ser Val Gin Asn 

1070 1075 1080 

Pro Val Tyr His Asn Gin Pro Leu Asn Pro Ala Pro Ser Arg Asp 

1085 1090 1095 

Pro His Tyr Gin Asp Pro His Ser Thr Ala Val Gly Asn Pro Glu 

1100 1105 1110 

Tyr Leu Asn Thr Val Gin Pro Thr Cys Val Asn Ser Thr Phe Asp 

1115 1120 1125 

Ser Pro Ala His Trp Ala Gin Lys Gly Ser His Gin He Ser Leu 

1130 1135 1140 

Asp Asn Pro Asp Tyr Gin Gin Asp Phe Phe Pro Lys Glu Ala Lys 

1145 1150 1155 

Pro Asn Gly He Phe Lys Gly Ser Thr Ala Glu Asn Ala Glu Tyr 

1160 1165 1170 

Leu Arg Val Ala Pro Gin Ser Ser Glu Phe He Gly Ala 

1175 1180 1185 
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<210> 117 

<211> 422 

<212> PRT 

<213> Homo sapiens 



<400> 117 



Met 


Asp 


Val 


Leu 


Ser 


Pro 


Gly Gin Gly Asn Asn Thr 


Thr 


Ser 


Pro 


Pro 


1 








5 










10 










15 




Ala 


Pro 


Phe 


Glu 


Thr 


Gly 


Gly Asn Thr 


Thr 


Gly 


He 


Ser 


Asp 


Val 


Thr 








20 










25 










30 






Val 


Ser 


Tyr 


Gin 


Val 


He 


Thr 


Ser 


Leu 


Leu 


Leu 


Gly 


Thr 


Leu 


He 


Phe 






35 










40 










45 








Cys 


Ala 


Val 


Leu 


Gly 


Asn 


Ala 


Cys 


Val 


Val 


Ala 


Ala 


He 


Ala 


Leu 


Glu 




50 










55 










60 










Arg 


Ser 


Leu 


Gin 


Asn 


Val 


Ala 


Asn 


Tyr 


Leu 


He 


Gly 


Ser 


Leu 


Ala 


Val 


65 










70 










75 










80 


Thr 


Asp 


Leu 


Met 


Val 


Ser 


Val 


Leu 


Val 


Leu 


Pro 


Met 


Ala 


Ala 


Leu 


Tyr 










85 










90 










95 




Gin 


Val 


Leu 


Asn 


Lys 


Trp 


Thr 


Leu 


Gly 


Gin 


Val 


Thr 


Cys 


Asp 


Leu 


Phe 








100 










105 










110 






lie 


Ala 


Leu 


Asp 


Val 


Leu 


Cys 


Cys 


Thr 


Ser 


Ser 


He 


Leu 


His 


Leu 


Cys 






115 










120 










125 








Ala 


lie 


Ala 


Leu 


Asp 


Arg 


Tyr 


Trp 


Ala 


He 


Thr 


Asp 


Pro 


He 


Asp 


Tyr 




130 










135 










140 










Val 


Asn 


Lys 


Arg 


Thr 


Pro 


Arg 


Arg 


Ala 


Ala 


Ala 


Leu 


lie 


Ser 


Leu 


Thr 


145 










150 










155 










16 0 


Trp 


Leu 


lie 


Gly 


Phe 


Leu 


He 


Ser 


lie 


Pro 


Pro 


Met 


Leu 


Gly 


Trp 


Arg 










165 










170 










175 




Thr 


Pro 


Glu 


Asp 


Arg 


Ser 


Asp 


Pro 


Asp 


Ala 


Cys 


Thr 


He 


Ser 


Lys 


Asp 








180 










185 










190 






His 


Gly 


Tyr 


Thr 


lie 


Tyr 


Ser 


Thr 


Phe 


Gly Ala 


Phe 


Tyr 


He 


Pro 


Leu 






195 










200 










205 








Leu 


Leu 


Met 


Leu 


Val 


Leu 


Tyr 


Gly 


Arg 


He 


Phe 


Arg 


Ala 


Ala 


Arg 


Phe 




210 










215 










220 










Arg 


lie 


Arg 


Lys 


Thr 


Val 


Lys 


Lys 


Val 


Glu 


Lys 


Thr 


Gly Ala 


Asp 


Thr 


225 










230 










235 










240 


Arg 


His 


Gly 


Ala 


Ser 


Pro 


Ala 


Pro 


Gin 


Pro 


Lys 


Lys 


Ser 


Val 


Asn 


Gly 










245 










250 










255 




Glu 


Ser 


Gly 


Ser 


Arg 


Asn 


Trp 


Arg 


Leu 


Gly Val 


Glu 


Ser 


Lys 


Ala 


Gly 








260 










265 










270 






Gly Ala 


Leu 


Cys 


Ala 


Asn 


Gly Ala 


Val 


Arg 


Gin 


Gly Asp 


Asp 


Gly Ala 






275 










280 










285 








Ala 


Leu 


Glu 


Val 


He 


Glu 


Val 


His 


Arg 


Val 


Gly 


Asn 


Ser 


Lys 


Glu 


His 




290 










295 










300 










Leu 


Pro 


Leu 


Pro 


Ser 


Glu 


Ala 


Gly 


Pro 


Thr 


Pro 


Cys 


Ala 


Pro 


Ala 


Ser 


305 










310 










315 










320 


Phe 


Glu 


Arg 


Lys 


Asn 


Glu 


Arg 


Asn 


Ala 


Glu 


Ala 


Lys 


Arg 


Lys 


Met 


Ala 










325 










330 










335 




Leu 


Ala 


Arg 


Glu 


Arg 


Lys 


Thr 


Val 


Lys 


Thr 


Leu 


Gly 


He 


He 


Met 


Gly 








340 










345 










350 






Thr 


Phe 


lie 


Leu 


Cys 


Trp 


Leu 


Pro 


Phe 


Phe 


He 


Val 


Ala 


Leu 


Val 


Leu 






355 










360 










365 








Pro 


Phe 


Cys 


Glu 


Ser 


Ser 


Cys 


His 


Met 


Pro 


Thr 


Leu 


Leu 


Gly 


Ala 


He 




370 










375 










380 










lie 


Asn 


Trp 


Leu 


Gly Tyr 


Ser 


Asn 


Ser 


Leu 


Leu 


Asn 


Pro 


Val 


He 


Tyr 


385 










390 










395 










400 


Ala 


Tyr 


Phe 


Asn 


Lys 


Asp 


Phe 


Gin 


Asn 


Ala 


Phe 


Lys 


Lys 


He 


He 


Lys 










405 










410 










415 




Cys 


Lys 


Phe 


Cys 


Arg 


Gin 























420 
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<210> 118 

<211> 129 

<212> PRT 

<213> Homo sapiens 



<400> 118 

His Lys Cys Asp 

1 

Leu Thr Glu Gin 
20 

Phe Ala Ala Ser 
35 

Ala Thr Val Leu 
50 

Cys Leu Gly Ala 
65 

Arg Phe Leu Lys 

Asn Ser Cys Pro 
100 

Leu Glu Arg Leu 
115 

Ser 



lie Thr Leu Gin 
5 

Lys Thr Leu Cys 

Lys Asn Thr Thr 
40 

Arg Gin Phe Tyr 
55 

Thr Ala Gin Gin 
70 

Arg Leu Asp Arg 
85 

Val Lys Glu Ala 

Lys Thr lie Met 
120 



Glu lie lie Lys 
10 

Thr Glu Leu Thr 
25 

Glu Lys Glu Thr 

Ser His His Glu 
60 

Phe His Arg His 
75 

Asn Leu Trp Gly 
90 

Asn Gin Ser Thr 
105 

Arg Glu Lys Tyr 



Thr Leu Asn Ser 
15 

Val Thr Asp lie 
30 

Phe Cys Arg Ala 
45 

Lys Asp Thr Arg 

Lys Gin Leu lie 
80 

Leu Ala Gly Leu 
95 

Leu Glu Asn Phe 
110 

Ser Lys Cys Ser 
125 



<210> 119 

<211> 113 

<212> PRT 

<213> Homo sapiens 



<400> 119 

Met Gly Pro Val 

1 

Leu Val Asn lie 
20 

Met Val Trp Ser 
35 

Glu Ser Leu He 
50 

Arg Met Leu Ser 
65 

Ser Ser Leu His 

Lys Asp Leu Leu 
100 

Asn 



Pro Pro Ser Thr 
5 

Thr Gin Asn Gin 

He Asn Leu Thr 
40 

Asn Val Ser Gly 
55 

Gly Phe Cys Pro 
70 

Val Arg Asp Thr 
85 

Leu His Leu Lys 



Ala Leu Arg Glu 
10 

Lys Ala Pro Leu 
25 

Ala Gly Met Tyr 

Cys Ser Ala lie 
60 

His Lys Val Ser 
75 

Lys He Glu Val 
90 

Lys Leu Phe Arg 
105 



Leu He Glu Glu 
15 

Cys Asn Gly Ser 
30 

Cys Ala Ala Leu 
45 

Glu Lys Thr Gin 

Ala Gly Gin Phe 
80 

Ala Gin Phe Val 
95 

Glu Gly Arg Phe 
110 



<210> 120 

<211> 726 

<212> PRT 

<213> Homo sapiens 



<400> 120 

Val Thr Lys Leu Leu Pro Ala Leu 
1 5 

Leu Leu Leu Leu Pro He Ala He 
20 

Arg Arg Asn Thr He His Glu Phe 

35 40 
He Lys He Asp Pro Ala Leu Lys 

50 55 
Ala Asp Gin Cys Ala Asn Arg Cys 



Leu Leu Gin His Val Leu Leu His 

10 15 

Pro Tyr Ala Glu Gly Gin Arg Lys 
25 3 0 

Lys Lys Ser Ala Lys Thr Thr Leu 
45 

He Lys Thr Lys Lys Val Asn Thr 
60 

Thr Arg Asn Lys Gly Leu Pro Phe 
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65 70 75 80 

Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg Lys Gin Cys Leu Trp 

85 90 95 

Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys Lys Glu Phe Gly His 

100 105 110 

Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He Arg Asn Cys He He 

115 120 125 

Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser lie Thr Lys Ser Gly 

130 135 140 

He Lys Cys Gin Pro Trp Ser Ser Met He Pro His Glu His Ser Phe 
145 150 155 160 

Leu Pro Ser Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg 

165 170 175 

Asn Pro Arg Gly Glu Glu Gly Gly Pro Trp Cys Phe Thr Ser Asn Pro 

180 185 190 

Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin Cys Ser Glu Val Glu 

195 200 205 

Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr 

210 215 220 

Glu Ser Gly Lys He Cys Gin Arg Trp Asp His Gin Thr Pro His Arg 
225 230 235 240 

His Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn 

245 250 255 

Tyr Cys Arg Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu 

260 265 270 

Asp Pro His Thr Arg Trp Glu Tyr Cys Ala He Lys Thr Cys Ala Asp 

275 280 285 

Asn Thr Met Asn Asp Thr Asp Val Pro Leu Glu Thr Thr Glu Cys He 

290 2 95 3 00 

Gin Gly Gin Gly Glu Gly Tyr Arg Gly Thr Val Asn Thr He Trp Asn 
305 310 315 320 

Gly He Pro Cys Gin Arg Trp Asp Ser Gin Tyr Pro His Glu His Asp 

325 330 335 

Met Thr Pro Glu Asn Phe Lys Cys Lys Asp Leu Arg Glu Asn Tyr Cys 

340 345 350 

Arg Asn Pro Asp Gly Ser Glu Ser Pro Trp Cys Phe Thr Thr Asp Pro 

3 55 3 60 3 65 

Asn He Arg Val Gly Tyr Cys Ser Gin He Pro Asn Cys Asp Met Ser 

370 375 380 

His Gly Gin Asp Cys Tyr Arg Gly Asn Gly Lys Asn Tyr Met Gly Asn 
385 390 395 400 

Leu Ser Gin Thr Arg Ser Gly Leu Thr Cys Ser Met Trp Asp Lys Asn 

405 410 415 

Met Glu Asp Leu His Arg His He Phe Trp Glu Pro Asp Ala Ser Lys 

420 425 430 

Leu Asn Glu Asn Tyr Cys Arg Asn Pro Asp Asp Asp Ala His Gly Pro 

435 440 445 

Trp Cys Tyr Thr Gly Asn Pro Leu He Pro Trp Asp Tyr Cys Pro He 

450 455 460 

Ser Arg Cys Glu Gly Asp Thr Thr Pro Thr He Val Asn Leu Asp His 
4 65 4 70 4 75 48 0 

Pro Val He Ser Cys Ala Lys Thr Lys Gin Leu Arg Val Val Asn Gly 

485 490 495 

He Pro Thr Arg Thr Asn He Gly Trp Met Val Ser Leu Arg Tyr Arg 

500 505 510 

Asn Lys His lie Cys Gly Gly Ser Leu He Lys Glu Ser Trp Val Leu 

515 520 525 

Thr Ala Arg Gin Cys Phe Pro Ser Arg Asp Leu Lys Asp Tyr Glu Ala 

530 535 540 

Trp Leu Gly He His Asp Val His Gly Arg Gly Asp Glu Lys Cys Lys 
545 550 555 ~ 560 

Gin Val Leu Asn Val Ser Gin Leu Val Tyr Gly Pro Glu Gly Ser Asp 
565 570 575 
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Leu 


Val 


Leu 


Met 


Lys 


Leu 


Ala 


Arg 


Pro 


Ala 


Val 


Leu 


Asp 


Asp 


Phe 


Val 








580 










585 










590 






Ser 


Thr 


He 


Asp 


Leu 


Pro 


Asn 


Tyr 


Gly 


Cys 


Thr 


He 


Pro 


Glu 


Lys 


Thr 






595 










600 










605 








Ser 


Cys 


Ser 


Val 


Tyr 


Gly 


Trp 


Gly 


Tyr 


Thr 


Gly 


Leu 


He 


Asn 


Tyr 


Asp 




610 










615 










620 










Gly 


Leu 


Leu 


Arg 


Val 


Ala 


His 


Leu 


Tyr 


He 


Met 


Gly 


Asn 


Glu 


Lys 


Cys 


625 










630 










635 










640 


Ser 


Gin 


His 


His 


Arg 


Gly 


Lys 


Val 


Thr 


Leu 


Asn 


Glu 


Ser 


Glu 


He 


Cys 










645 










650 
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Leu 


Leu 


Leu 


Phe 


Arg 


Val 


Ala 


Gly 


Leu 


Glu 
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Leu 


Gly 


Asp 


Leu 


Phe 


Pro 


Asn 


Leu 


Thr 


Val 


He 


Arg 


Gly 


Trp 


Lys 


65 










70 










75 










80 


Leu 


Phe 


Tyr 


Asn 


Tyr 


Ala 


Leu 


Val 


He 


Phe 


Glu 


Met 


Thr 


Asn 


Leu 


Lys 










85 










90 










95 


Asp 


He 


Gly 


Leu 


Tyr 


Asn 


Leu 


Arg 


Asn 


He 


Thr Arg 


Gly Ala 


He 


Arg 








100 










105 










110 






He 


Glu 


Lys 


Asn 


Ala 


Asp 


Leu 


Cys 


Tyr 


Leu 


Ser 


Thr 


Val 


Asp 


Trp 


Ser 






115 










120 
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Leu 


He 


Leu 


Asp 


Ala 


Val 


Ser 


Asn 


Asn 


Tyr 


He 


Val 


Gly 


Asn 


Lys 


Pro 




130 










135 










140 










Pro 


Lys 


Glu 


Cys 


Gly Asp 


Leu 


Cys 


Pro 


Gly 


Thr 


Met 


Glu 


Glu 


Lys 


Pro 


145 










150 










155 










160 


Met 


Cys 


Glu 


Lys 


Thr 


Thr 


He 


Asn 


Asn 


Glu 


Tyr 


Asn 


Tyr 


Arg 


Cys 


Trp 










165 
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175 




Thr 


Thr 


Asn 


Arg 


Cys 


Gin 


Lys 


Met 


Cys 


Pro 


Ser 


Thr 


Cys 


Gly 


Lys 


Arg 








180 










185 
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Ala 


Cys 


Thr 


Glu 


Asn 


Asn 


Glu 


Cys 


Cys 


His 


Pro 


Glu 


Cys 


Leu 


Gly 


Ser 






195 










200 










205 








Cys 


Ser 


Ala 


Pro 


Asp 


Asn 


Asp 


Thr 


Ala 


Cys 


Val 


Ala 


Cys 


Arg 


His 


Tyr 




210 










215 










220 










Tyr 


Tyr 


Ala 


Gly 


Val 


Cys 


Val 


Pro 


Ala 


Cys 


Pro 


Pro 


Asn 


Thr 


Tyr 


Arg 


225 










230 










235 










240 


Phe 


Glu 


Gly 


Trp 


Arg 


Cys 


Val 


Asp 


Arg 


Asp 


Phe 


Cys 


Ala 


Asn 


He 


Leu 










245 










250 










255 




Ser 


Ala 


Glu 


Ser 


Ser 


Asp 


Ser 


Glu 


Gly Phe 


Val 


He 


His 


Asp 


Gly 


Glu 
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260 265 270 



Cys 


Met 


Gin 


Glu 


Cys 


Pro 


Ser 


Gly 


Phe 


He 


Arg 


Asn 


Gly 


Ser 


Gin 


Ser 






275 










280 










285 








Met 


Tyr 


Cys 


lie 


Pro 


Cys 


Glu Gly 


Pro 


Cys 


Pro 


Lys 


Val 


Cys 


Glu 


Glu 




290 










295 










300 










Glu 


Lys 


Lys 


Thr 


Lys 


Thr 


He 


Asp 


Ser 


Val 


Thr 


Ser 


Ala 


Gin 


Met 


Leu 


305 










310 










315 










320 


Gin 


Gly 


Cys 


Thr 


lie 


Phe 


Lys 


Gly Asn 


Leu 


Leu 


He 


Asn 


He 


Arg 


Arg 










325 










330 










335 


Gly 


Asn 


Asn 


lie 


Ala 


Ser 


Glu 


Leu 


Glu 


Asn 


Phe 


Met 


Gly 


Leu 


He 


Glu 








340 










345 










350 






Val 


Val 


Thr 


Gly 


Tyr 


Val 


Lys 


He 


Arg 


His 


Ser 


His 


Ala 


Leu 


Val 


Ser 






355 










360 










365 








Leu 


Ser 


Phe 


Leu 


Lys 


Asn 


Leu 


Arg 


Leu 


He 


Leu 


Gly 


Glu 


Glu 


Gin 


Leu 




370 










375 










380 










Glu 


Gly 


Asn 


Tyr 


Ser 


Phe 


Tyr 


Val 


Leu 


Asp 


Asn 


Gin 


Asn 


Leu 


Gin 


Gin 


385 










390 










395 










400 


Leu 


Trp 


Asp 


Trp 


Asp 


His 


Arg 


Asn 


Leu 


Thr 


He 


Lys 


Ala 


Gly 


Lys 


Met 










405 










410 










415 




Tyr 


Phe 


Ala 


Phe 


Asn 


Pro 


Lys 


Leu 


Cys 


Val 


Ser 


Glu 


He 


Tyr 


Arg 


Met 








420 










425 










430 






Glu 


Glu 


Val 


Thr 


Gly Thr 


Lys 


Gly Arg Gin 


Ser 


Lys 


Gly Asp 


He 


Asn 






435 










440 










445 








Thr 


Arg 


Asn 


Asn 


Gly 


Glu 


Arg 


Ala 


Ser 


Cys 


Glu 













450 455 



<210> 127 

<211> 146 

<212> PRT 

<213> Homo sapiens 



<400> 


127 












Val 


Pro 


He 


Gin 


Lys 


Val 


Gin 


Asp 


1 








5 








He 


Val 


Thr 


Arg 


He 


Asn 


Asp 


He 








20 










Lys 


Gin 


Lys 


Val 


Thr 


Gly 


Leu 


Asp 






35 










40 


Leu 


Thr 


Leu 


Ser 


Lys 


Met 


Asp 


Gin 




50 










55 




Leu 


Thr 


Ser 


Met 


Pro 


Ser 


Arg 


Asn 


65 










70 






Glu 


Asn 


Leu 


Arg 


Asp 


Leu 


Leu 


His 










85 








His 


Leu 


Pro 


Trp 


Ala 


Ser 


Gly 


Leu 








100 










Val 


Leu 


Glu 


Ala 


Ser 


Gly 


Tyr 


Ser 






115 










120 


Leu 


Gin 


Gly 


Ser 


Leu 


Gin 


Asp 


Met 




130 










135 





Gly Cys 
145 



Asp Thr Lys Thr Leu He Lys Thr 

10 15 
Ser His Thr Gin Ser Val Ser Ser 
25 30 
Phe He Pro Gly -Leu His Pro He 
45 

Thr Leu Ala Val Tyr Gin Gin He 
60 

Val He Gin He Ser Asn Asp Leu 

75 80 
Val Leu Ala Phe Ser Lys Ser Cys 

90 95 
Glu Thr Leu Asp Ser Leu Gly Gly 
105 110 
Thr Glu Val Val Ala Leu Ser Arg 
125 

Leu Trp Gin Leu Asp Leu Ser Pro 
140 



<210> 128 

<211> 327 

<212> PRT 

<213> Homo sapiens 

<400> 128 

Lys Glu He Thr Asn Ala Leu Glu Thr Trp Gly Ala Leu Gly Gin Asp 
1 5 10 15 
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He 


Asn 


Leu 


Asp 


He 


Pro 


Ser 


Phe 


Gin 


Met 








20 










25 




He 


Lys 


Trp 


Glu 


L Y S 


Thr 


Ser 


Asp 


Lys 


Lys 






35 










40 






Lys 


Glu 


Lys 


Glu 


Thr 


Phe 


Lys 


Glu 


Lys 


Asp 




50 










55 








Asn 


Gly 


Thr 


Leu 


Lys 


He 


Lys 


His 


Leu 


Lys 


65 










70 










Tyr 


Lys 


Val 


Ser 


He 


Tyr 


Asp 


Thr 


Lys 


Gly 










85 










90 


He 


Phe 


Asp 


Leu 


Lys 


He 


Gin 


Glu 


Arg 


Val 








100 










105 




Trp 


Thr 


Cys 


He 


Asn 


Thr 


Thr 


Leu 


Thr 


Cys 






115 










120 






Asp 


Pro 


Glu 


Leu 


Asn 


Leu 


Tyr 


Gin 


Asp 


Gly 




13 0 










135 








Gin 


Arg 


Val 


He 


Thr 


His 


Lys 


Trp 


Thr 


Thr 


145 










150 










Lys 


Cys 


Thr 


Ala 


Gly 


Asn 


Lys 


Val 


Ser 


Lys 










165 










170 


Val 


Ser 


Cys 


Pro 


Glu 


Lys 


Gly 


Leu 


Asp 


He 








180 










185 




Cys 


Gly 


Gly 


Gly 


Ser 


Leu 


Leu 


Met 


Val 


Phe 






195 










200 






Tyr 


He 


Thr 


Lys 


Arg 


Lys 


Lys 


Gin 


Arg 


Ser 




210 










215 








Leu 


Glu 


Thr 


Arg 


Ala 


His 


Arg 


Val 


Ala 


Thr 


225 










230 










Pro 


Gin 


Gin 


He 


Pro 


Ala 


Ser 


Thr 


Pro 


Gin 










245 










250 


His 


Pro 


Pro 


Pro 


Pro 


Pro 


Gly 


His 


Arg 


Ser 








260 










265 




Pro 


Pro 


Pro 


Pro 


Gly 


His 


Arg 


Val 


Gin 


His 






275 










280 






Pro 


Ala 


Pro 


Ser 


Gly 


Thr 


Gin 


Val 


His 


Gin 




290 










295 








Pro 


Arg 


Pro 


Arg 


Val 


Gin 


Pro 


Lys 


Pro 


Pro 


305 










310 










Ser 


Leu 


Ser 


Pro 


Ser 


Ser 


Asn 









325 
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Ser 


Asp 


Asp 


He 


Asp Asp 








30 






Lys 


He 


Ala 


Gin 


Phe 


Arg 






45 








Thr 


Tyr 


Lys 


Leu 


Phe 


Lys 




60 










Thr 


Asp 


Asp 


Gin 


Asp 


He 


75 










80 


Lys 


Asn 


Val 


Leu 


Glu 


Lys 










95 




Ser 


Lys 


Pro 


Lys 


He 


Ser 








110 






Glu 


Val 


Met 


Asn 


Gly Thr 






125 








Lys 


His 


Leu 


Lys 


Leu 


Ser 




140 










Ser 


Leu 


Ser 


Ala 


Lys 


Phe 


155 










160 


Glu 


Ser 


Ser 


Val 


Glu 


Pro 










175 




Tyr 


Leu 


He 


He 


Gly 


He 








190 






Val 


Ala 


Leu 


Leu 


Val 


Phe 






205 








Arg 


Arg 


Asn 


Asp 


Glu 


Glu 




220 










Glu 


Glu 


Arg 


Gly 


Arg 


Lys 


235 










240 


Asn 


Pro 


Ala 


Thr 


Ser 


Gin 










255 




Gin 


Ala 


Pro 


Ser 


His 


Arg 








270 






Gin 


Pro 


Gin 


Lys 


Arg 


Pro 






285 








Gin 


Lys 


Gly 


Pro 


Pro 


Leu 




300 










His 


Gly 


Ala 


Ala 


Glu 


Asn 


315 










320 



<210> 129 

•<211> 433 

<212> PRT 

<213> Homo sapiens 

<400> 129 

Lys Lys Val Val Leu Gly Lys Lys 
1 5 

Thr Ala Ser Gin Lys Lys Ser He 
20 

Gin He Lys He Leu Gly Asn Gin 

35 40 
Ser Lys Leu Asn Asp Arg Ala Asp 

50 55 
Gly Asn Phe Pro Leu He He Lys 
65 70 
Thr Tyr He Cys Glu Val Glu Asp 
85 

Val Phe Gly Leu Thr Ala Asn Ser 
100 

Ser Leu Thr Leu Thr Leu Glu Ser 



Gly Asp 


Thr 


Val 


Glu 


Leu 


Thr 


Cys 




10 










15 




Gin 


Phe 


His 


Trp 


Lys 


Asn 


Ser 


Asn 


25 










30 






Gly 


Ser 


Phe 


Leu 


Thr 


Lys 


Gly 


Pro 










45 








Ser 


Arg 


Arg 


Ser 


Leu 


Trp 


Asp 


Gin 








60 










Asn 


Leu 


Lys 


He 


Glu 


Asp 


Ser 


Asp 






75 










80 


Gin 


Lys 


Glu 


Glu 


Val 


Gin 


Leu 


Leu 




90 










95 




Asp 


Thr 


His 


Leu 


Leu 


Gin 


Gly 


Gin 


105 










110 






Pro 


Pro 


Gly 


Ser 


Ser 


Pro 


Ser 


Val 
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115 120 125 



Gin 


Cys 


Arg 


Ser 


Pro 


Arg 


Gly 


Lys 


Asn 


He 


Gin 


Gly Gly 


Lys 


Thr 


Leu 




13 0 










135 










140 










Ser 


Val 


Ser 


Gin 


Leu 


Glu 


Leu 


Gin 


Asp 


Ser 


Gly 


Thr 


Trp 


Thr 


Cys 


Thr 


145 










150 










155 










160 


Val 


Leu 


Gin 


Asn 


Gin 


Lys 


Lys 


Val 


Glu 


Phe 


Lys 


He 


Asp 


He 


Val 


Val 










165 










170 










175 




Leu 


Ala 


Phe 


Gin 


Lys 


Ala 


Ser 


Ser 


He 


Val 


Tyr 


Lys 


Lys 


Glu 


Gly Glu 








180 










185 










190 






Gin 


Val 


Glu 


Phe 


Ser 


Phe 


Pro 


Leu 


Ala 


Phe 


Thr 


Val 


Glu 


Lys 


Leu 


Thr 






195 










200 










205 








Gly 


Ser 


Gly Glu Leu 


Trp 


Trp 


Gin 


Ala 


Glu 


Arg 


Ala 


Ser 


Ser 


Ser 


Lys 




210 










215 










220 










Ser 


Trp 


lie 


Thr 


Phe 


Asp 


Leu 


Lys 


Asn 


Lys 


Glu 


Val 


Ser 


Val 


Lys 


Arg 


225 










230 










235 










240 


Val 


Thr 


Gin 


Asp 


Pro 


Lys 


Leu 


Gin 


Met 


Gly 


Lys 


Lys 


Leu 


Pro 


Leu 


His 










245 










250 










255 




Leu 


Thr 


Leu 


Pro 


Gin 


Ala 


Leu 


Pro 


Gin 


Tyr 


Ala 


Gly 


Ser 


Gly 


Asn 


Leu 








260 










265 










270 






Thr 


Leu 


Ala 


Leu 


Glu 


Ala 


Lys 


Thr 


Gly Lys 


Leu 


His 


Gin 


Glu 


Val 


Asn 






275 










280 










285 








Leu 


Val 


Val 


Met 


Arg 


Ala 


Thr 


Gin 


Leu 


Gin 


Lys 


Asn 


Leu 


Thr 


Cys 


Glu 




290 










295 










300 








Val 


Trp 


Gly 


Pro 


Thr 


Ser 


Pro 


Lys 


Leu 


Met 


Leu 


Ser 


Leu 


Lys 


Leu 


Glu 


3 05 










310 










315 








320 


Asn 


Lys 


Glu 


Ala 


Lys 


Val 


Ser 


Lys 


Arg 


Glu 


Lys 


Ala 


Val 


Trp 


Val 


Leu 










325 










330 










335 




Asn 


Pro 


Glu 


Ala 


Gly 


Met 


Trp 


Gin 


Cys 


Leu 


Leu 


Ser 


Asp 


Ser 


Gly 


Gin 








340 










345 










350 






Val 


Leu 


Leu 


Glu 


Ser 


Asn 


He 


Lys 


Val 


Leu 


Pro 


Thr 


Trp 


Ser 


Thr 


Pro 






355 










360 
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Val 


Gin 


Pro 


Met 


Ala 


Leu 


He 


Val 


Leu 


Gly 


Gly 


Val 


Ala 


Gly 


Leu 


Leu 




370 










375 










380 










Leu 


Phe 


lie 


Gly 


Leu 


Gly 


He 


Phe 


Phe 


Cys 


Val 


Arg 


Cys 


Arg 


His 


Arg 


385 










390 










395 










400 


Arg 


Arg 


Gin 


Ala 


Glu 


Arg 


Met 


Ser 


Gin 


He 


Lys 


Arg 


Leu 


Leu 


Ser 


Glu 










405 










410 










415 




Lys 


Lys 


Thr 


Cys 


Gin 


Cys 


Pro 


His 


Arg 


Phe 


Gin 


Lys 


Thr 


Cys 


Ser 


Pro 








420 










425 










430 






lie 

































<210> 


130 
























<211> 


1145 
























<212> 


PRT 
























<213> 


Homo 


sapiens 




















<400> 


130 
























Tyr Asn 


Leu 


Asp 


Val Arg 


Gly Ala 


Arg 


Ser 


Phe 


Ser 


Pro 


Pro 


Arg 


Ala 


1 






5 






10 










15 




Gly Arg 


His 


Phe 


Gly Tyr 


Arg Val 


Leu 


Gin 


Val 


Gly 


Asn 


Gly Val 


He 






20 






25 










30 






Val Gly 


Ala 


Pro 


Gly Glu 


Gly Asn 


Ser 


Thr 


Gly 


Ser 


Leu 


Tyr 


Gin 


Cys 




35 






40 










45 






Gin Ser 


Gly Thr 


Gly His 


Cys Leu 


Pro 


Val 


Thr 


Leu 


Arg 


Gly 


Ser 


Asn 


50 








55 








60 










Tyr Thr 


Ser 


Lys 


Tyr Leu 


Gly Met 


Thr 


Leu 


Ala 


Thr 


Asp 


Pro 


Thr 


Asp 


65 






70 








75 










80 


Gly Ser 


He 


Leu 


Ala Cys 


Asp Pro 


Gly 


Leu 


Ser 


Arg 


Thr 


Cys 


Asp 


Gin 








85 






90 










95 




Asn Thr 


Tyr 


Leu 


Ser Gly 


Leu Cys 


Tyr 


Leu 


Phe 


Arg 


Gin 


Asn 


Leu 


Gin 






100 






105 










110 






Gly Pro 


Met 


Leu 


Gin Gly Arg Pro 


Gly 


Phe 


Gin 


Glu 


Cys 


He 


Lys 


Gly 
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115 120 125 

Asn Val Asp Leu Val Phe Leu Phe Asp Gly Ser Met Ser Leu Gin Pro 

13 0 13 5 14 0 

Asp Glu Phe Gin Lys lie Leu Asp Phe Met Lys Asp Val Met Lys Lys 
145 150 155 160 

Leu Ser Asn Thr Ser Tyr Gin Phe Ala Ala Val Gin Phe Ser Thr Ser 

165 170 175 

Tyr Lys Thr Glu Phe Asp Phe Ser Asp Tyr Val Lys Arg Lys Asp Pro 

180 185 190 

Asp Ala Leu Leu Lys His Val Lys His Met Leu Leu Leu Thr Asn Thr 

195 200 205 

Phe Gly Ala lie Asn Tyr Val Ala Thr Glu Val Phe Arg Glu Glu Leu 

210 215 220 

Gly Ala Arg Pro Asp Ala Thr Lys Val Leu lie lie lie Thr Asp Gly 
22 5 230 235 ~ 240 

Glu Ala Thr Asp Ser Gly Asn lie Asp Ala Ala Lys Asp He He Arg 

245 250 ~ 255 

Tyr He He Gly He Gly Lys His Phe Gin Thr Lys Glu Ser Gin Glu 

260 265 270 

Thr Leu His Lys Phe Ala Ser Lys Pro Ala Ser Glu Phe Val Lys He 

275 280 285 

Leu Asp Thr Phe Glu Lys Leu Lys Asp Leu Phe Thr Glu Leu Gin Lys 

290 295 300 

Lys He Tyr Val lie Glu Gly Thr Ser Lys Gin Asp Leu Thr Ser Phe 
305 310 315- 320 

Asn Met Glu Leu Ser Ser Ser Gly He Ser Ala Asp Leu Ser Arg Gly 

325 330 335 

His Ala Val Val Gly Ala Val Gly Ala Lys Asp Trp Ala Gly Gly Phe 

340 345 350 

Leu Asp Leu Lys Ala Asp Leu Gin Asp Asp Thr Phe He Gly Asn Glu 

355 360 365 

Pro Leu Thr Pro Glu Val Arg Ala Gly Tyr Leu Gly Tyr Thr Val Thr 

370 375 ' 380 

Trp Leu Pro Ser Arg Gin Lys Thr Ser Leu Leu Ala Ser Gly Ala Pro 
385 390 395 400 

Arg Tyr Gin His Met Gly Arg Val Leu Leu Phe Gin Glu Pro Gin Gly 

405 410 415 

Gly Gly His Trp Ser Gin Val Gin Thr He His Gly Thr Gin He Gly 

420 425 430 

Ser Tyr Phe Gly Gly Glu Leu Cys Gly Val Asp Val Asp Gin Asp Gly 

435 440 445 

Glu Thr Glu Leu Leu Leu He Gly Ala Pro Leu Phe Tyr Gly Glu Gin 

450 455 460 

Arg Gly Gly Arg Val Phe He Tyr Gin Arg Arg Gin Leu Gly Phe Glu 
465 470 475 480 

Glu Val Ser Glu Leu Gin Gly Asp Pro Gly Tyr Pro Leu Gly Arg Phe 

485 490 ~ 495 

Gly Glu Ala He Thr Ala Leu Thr Asp He Asn Gly Asp Gly Leu Val 

500 505 " 510 

Asp Val Ala Val Gly Ala Pro Leu Glu Glu Gin Gly Ala Val Tyr He 

515 520 525 

Phe Asn Gly Arg His Gly Gly Leu Ser Pro Gin Pro Ser Gin Arg He 

53 0 53 5 54 0 

Glu Gly Thr Gin Val Leu Ser Gly He Gin Trp Phe Gly Arg Ser He 
54 5 550 555 ~ ~ 560 

His Gly Val Lys Asp Leu Glu Gly Asp Gly Leu Ala Asp Val Ala Val 

565 570 575 

Gly Ala Glu Ser Gin Met He Val Leu Ser Ser Arg Pro Val Val Asp 

580 585 590 

Met Val Thr Leu Met Ser Phe Ser Pro Ala Glu He Pro Val His Glu 

595 600 605 

Val Glu Cys Ser Tyr Ser Thr Ser Asn Lys Met Lys Glu Gly Val Asn 
610 615 620 
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He Thr He Cys Phe Gin He Lys Ser Leu Tyr Pro Gin Phe Gin Gly 
625 630 635 640 

Arg Leu Val Ala Asn Leu Thr Tyr Thr Leu Gin Leu Asp Gly His Arg 

645 650 655 

Thr Arg Arg Arg Gly Leu Phe Pro Gly Gly Arg His Glu Leu Arg Arg 

660 665 670 

Asn He Ala Val Thr Thr Ser Met Ser Cys Thr Asp Phe Ser Phe His 

675 680 685 

Phe Pro Val Cys Val Gin Asp Leu He Ser Pro He Asn Val Ser Leu 

690 695 700 

Asn Phe Ser Leu Trp Glu Glu Glu Gly Thr Pro Arg Asp Gin Arg Ala 
705 710 715 720 

Gin Gly Lys Asp He Pro Pro He Leu Arg Pro Ser Leu His Ser Glu 

725 730 - 735 

Thr Trp Glu He Pro Phe Glu Lys Asn Cys Gly Glu Asp Lys Lys Cys 

740 745 750 

Glu Ala Asn Leu Arg Val Ser Phe Ser Pro Ala Arg Ser Arg Ala Leu 

755 760 765 

Arg Leu Thr Ala Phe Ala Ser Leu Ser Val Glu Leu Ser Leu Ser Asn 

770 775 780 

Leu Glu Glu Asp Ala Tyr Trp Val Gin Leu Asp Leu His Phe Pro Pro 
785 790 795 800 

Gly Leu Ser Phe Arg Lys Val Glu Met Leu Lys Pro His Ser Gin He 

805 810 815 

Pro Val Ser Cys Glu Glu Leu Pro Glu Glu Ser Arg Leu Leu Ser Arg 

820 825 830 

Ala Leu Ser Cys Asn Val Ser Ser Pro He Phe Lys Ala Gly His Ser 

835 840 845 

Val Ala Leu Gin Met Met Phe Asn Thr Leu Val Asn Ser Ser Trp Gly 

850 855 860 

Asp Ser Val Glu Leu His Ala Asn Val Thr Cys Asn Asn Glu Asp Ser 
865 870 875 880 

Asp Leu Leu Glu Asp Asn Ser Ala Thr Thr He He Pro He Leu Tyr 

885 890 895 

Pro He Asn He Leu He Gin Asp Gin Glu Asp Ser Thr Leu Tyr Val 

900 905 910 

Ser Phe Thr Pro Lys Gly Pro Lys He His Gin Val Lys His Met Tyr 

915 920 925 

Gin Val Arg He Gin Pro Ser He His Asp His Asn He Pro Thr Leu 

930 935 940 

Glu Ala Val Val Gly Val Pro Gin Pro Pro Ser Glu Gly Pro He Thr 
945 950 955 960 

His Gin Trp Ser Val Gin Met Glu Pro Pro Val Pro Cys His Tyr Glu 

965 970 975 

Asp Leu Glu Arg Leu Pro Asp Ala Ala Glu Pro Cys Leu Pro Gly Ala 

980 985 990 

Leu Phe Arg Cys Pro Val Val Phe Arg Gin Glu He Leu Val Gin Val 

995 1000 1005 

He Gly Thr Leu Glu Leu Val Gly Glu He Glu Ala Ser Ser Met 

1010 1015 1020 

Phe Ser Leu Cys Ser Ser Leu Ser He Ser Phe Asn Ser Ser Lys 

1025 1030 1035 

His Phe His Leu Tyr Gly Ser Asn Ala Ser Leu Ala Gin Val Val 

1040 1045 1050 

Met Lys Val Asp Val Val Tyr Glu Lys Gin Met Leu Tyr Leu Tyr 

1055 1060 1065 

Val Leu Ser Gly He Gly Gly Leu Leu Leu Leu Leu Leu He Phe 

1070 1075 1080 

He Val Leu Tyr Lys Val Gly Phe Phe Lys Arg Asn Leu Lys Glu 

1085 1090 1095 

Lys Met Glu Ala Gly Arg Gly Val Pro Asn Gly lie Pro Ala Glu 

1100 1105 1110 

Asp Ser Glu Gin Leu Ala Ser Gly Gin Glu Ala Gly Asp Pro Gly 
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1115 1120 1125 

Cys Leu Lys Pro Leu His Glu Lys Asp Ser Glu Ser Gly Gly Gly 

1130 1135 1140 

Lys Asp 

1145 



<210> 131 

<211> 660 

<212> PRT 

<213> Homo sapiens 



<400> 131 



Met 


Glu 


Ala 


Leu 


Met 


Ala 


Arg 


Gly 


1 








5 








Leu 


Cys 


Leu 


Leu 


Gly 


Cys 


Leu 


Leu 








20 










Pro 


He 


lie 


Lys 


Phe 


Pro 


Gly 


Asp 






35 










40 


Leu 


Ala 


Val 


Gin 


Tyr 


Leu 


Asn 


Thr 




50 










55 




Cys 


Asn 


Leu 


Phe 


Val 


Leu 


Lys 


Asp 


65 










70 






Phe 


Gly 


Leu 


Pro 


Gin 


Thr 


Gly 


Asp 










85 








Met 


Arg 


Lys 


Pro 


Arg 


Cys 


Gly 


Asn 








100 










Phe 


Pro 


Arg 


Lys 


Pro 


Lys 


Trp 


Asp 






115 










120 


lie 


Gly 


Tyr 


Thr 


Pro 


Asp 


Leu 


Asp 




130 










135 




Ala 


Arg 


Ala 


Phe 


Gin 


Val 


Trp 


Ser 


145 










150 






Arg 


He 


His 


Asp 


Gly 


Glu 


Ala 


Asp 










165 








Glu 


His 


Gly 


Asp 


Gly 


Tyr 


Pro 


Phe 








180 










His 


Ala 


Phe 


Ala 


Pro 


Gly 


Thr 


Gly 






195 










200 


Asp 


Asp 


Glu 


Leu 


Trp 


Thr 


Leu 


Gly 




210 










215 




Tyr 


Gly 


Asn 


Ala 


Asp 


Gly 


Glu 


Tyr 


225 










230 






Gly Lys 


Glu 


Tyr 


Asn 


Ser 


Cys 


Thr 










245 








Leu 


Trp 


Cys 


Ser 


Thr 


Thr 


Tyr 


Asn 








260 










Phe 


Cys 


Pro 


His 


Glu 


Ala 


Leu 


Phe 






275 










280 


Gin 


Pro 


Cys 


Lys 


Phe 


Pro 


Phe 


Arg 




290 










295 




Cys 


Thr 


Thr 


Glu 


Gly 


Arg 


Thr 


Asp 


305 










310 






Glu 


Asp 


Tyr 


Asp 


Arg 


Asp 


Lys 


Lys 










325 








Met 


Ser 


Thr 


Val 


Gly 


Gly 


Asn 


Ser 








340 










Phe 


Thr 


Phe 


Leu 


Gly 


Asn 


Lys 


Tyr 






3 55 










360 


Ser 


Asp 


Gly 


Lys 


Met 


Trp 


Cys 


Ala 




370 










375 




Arg 


Lys 


Trp 


Gly 


Phe 


Cys 


Pro 


Asp 


385 










390 







Ala 


Leu 


Thr 


Gly 


Pro 


Leu 


Arg 


Ala 




10 










15 




Ser 


His 


Ala 


Ala 


Ala 


Ala 


Pro 


Ser 


25 










30 






Val 


Ala 


Pro 


Lys 


Thr 


Asp 


Lys 


Glu 










45 








Phe 


Tyr 


Gly 


Cys 


Pro 


Lys 


Glu 


Ser 








60 










Thr 


Leu 


Lys 


Lys 


Met 


Gin 


Lys 


Phe 






75 










80 


Leu 


Asp 


Gin 


Asn 


Thr 


He 


Glu 


Thr 




90 










95 




Pro 


Asp 


Val 


Ala 


Asn 


Tyr 


Asn 


Phe 


105 










110 






Lys 


Asn 


Gin 


He 


Thr 


Tyr 


Arg 


He 










125 








Pro 


Glu 


Thr 


Val 


Asp 


Asp 


Ala 


Phe 








140 










Asp 


Val 


Thr 


Pro 


Leu 


Arg 


Phe 


Ser 






155 










160 


He 


Met 


He 


Asn 


Phe 


Gly 


Arg 


Trp 




170 










175 




Asp 


Gly 


Lys 


Asp 


Gly 


Leu 


Leu 


Ala 


185 










190 






Val 


Gly 


Gly 


Asp 


Ser 


His 


Phe 


Asp 










205 








Glu 


Gly 


Gin 


Val 


Val 


Arg 


Val 


Lys 








220 










Cys 


Lys 


Phe 


Pro 


Phe 


Leu 


Phe 


Asn 






235 










240 


Asp 


Thr 


Gly 


Arg 


Ser 


Asp 


Gly 


Phe 




250 










255 




Phe 


Glu 


Lys 


Asp 


Gly 


Lys 


Tyr 


Gly 


265 










270 






Thr 


Met 


Gly 


Gly 


Asn 


Ala 


Glu 


Gly 










285 








Phe 


Gin 


Gly 


Thr 


Ser 


Tyr 


Asp 


Ser 








300 










Gly 


Tyr 


Arg 


Trp 


Cys 


Gly 


Thr 


Thr 






315 










320 


Tyr 


Gly 


Phe 


Cys 


Pro 


Glu 


Thr 


Ala 




330 










335 




Glu 


Gly 


Ala 


Pro 


Cys 


Val 


Phe 


Pro 


345 










350 






Glu 


Ser 


Cys 


Thr 


Ser 


Ala 


Gly 


Arg 










365 








Thr 


Thr 


Ala 


Asn 


Tyr 


Asp 


Asp 


Asp 








380 










Gin 


Gly 


Tyr 


Ser 


Leu 


Phe 


Leu 


Val 






395 










400 
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Ala 


Ala 


His 


Glu 


Phe 


Gly His 


Ala 


Met 


Gly Leu Glu 


His 


Ser 


Gin 


Asp 










405 










410 










415 




Pro 


Gly 


Ala 


Leu 
420 


Met 


Ala 


Pro 


He 


Tyr 
425 


Thr 


Tyr 


Thr 


Lys 


Asn 
430 


Phe 


Arg 


Leu 


Ser 


Gin 
435 


Asp 


Asp 


lie 


Lys 


Gly 
440 


He 


Gin 


Glu 


Leu 


Tyr 
445 


Gly 


Ala 


Ser 


Pro 


Asp 
450 


lie 


Asp 


Leu 


Gly 


Thr 
455 


Gly 


Pro 


Thr 


Pro 


Thr 
460 


Leu 


Gly 


Pro 


Val 


Thr 


Pro 


Glu 


lie 


Cys 


Lys 


Gin 


Asp 


He 


Val 


Phe 


Asp 


Gly 


lie 


Ala 


Gin 


465 










470 










475 










480 


lie 


Arg 


Gly 


Glu 


lie 
485 


Phe 


Phe 


Phe 


Lys 


Asp 
490 


Arg 


Phe 


He 


Trp 


Arg 
495 


Thr 


Val 


Thr 


Pro 


Arg 
500 


Asp 


Lys 


Pro 


Met 


Gly 
505 


Pro 


Leu 


Leu 


Val 


Ala 
510 


Thr 


Phe 


Trp 


Pro 


Glu 
515 


Leu 


Pro 


Glu 


Lys 


He 
520 


Asp 


Ala 


Val 


Tyr 


Glu 
525 


Ala 


Pro 


Gin 


Glu 


Glu 


Lys 


Ala 


Val 


Phe 


Phe 


Ala 


Gly Asn Glu 


Tyr 


Trp 


lie 


Tyr 


Ser 




530 










535 










540 










Ala 


Ser 


Thr 


Leu 


Glu 


Arg 


Gly 


Tyr 


Pro 


Lys 


Pro 


Leu 


Thr 


Ser 


Leu 


Gly 


545 










550 










555 










560 


Leu 


Pro 


Pro 


Asp 


Val 
565 


Gin 


Arg 


Val 


Asp 


Ala 
570 


Ala 


Phe 


Asn 


Trp 


Ser 
575 


Lys 


Asn 


Lys 


Lys 


Thr 


Tyr 


He 


Phe 


Ala 


Gly Asp 


Lys 


Phe 


Trp 


Arg 


Tyr 


Asn 








580 










585 










590 






Glu 


Val 


Lys 
595 


Lys 


Lys 


Met 


Asp 


Pro 
600 


Gly 


Phe 


Pro 


Lys 


Leu 
605 


lie 


Ala 


Asp 


Ala 


Trp 
610 


Asn 


Ala 


lie 


Pro 


Asp 
615 


Asn 


Leu 


Asp 


Ala 


Val 
620 


Val 


Asp 


Leu 


Gin 


Gly Gly 


Gly His 


Ser 


Tyr 


Phe 


Phe 


L y s 


Gly Ala 


Tyr 


Tyr 


Leu 


Lys 


Leu 


625 










630 










635 










640 


Glu 


Asn 


Gin 


Ser 


Leu 
645 


Lys 


Ser 


Val 


Lys 


Phe 
650 


Gly 


Ser 


He 


Lys 


Ser 
655 


Asp 


Trp 


Leu 


Gly 


Cys 
660 



























<210> 132 

<211> 707 

<212> PRT 

<213> Homo sapiens 



<400> 132 



Met 


Ser 


Leu 


Trp 


Gin 


Pro 


Leu 


Val 


Leu 


Val 


Leu 


Leu 


Val 


Leu 


Gly 


Cys 


1 








5 










10 










15 


Cys 


Phe 


Ala 


Ala 


Pro 


Arg 


Gin 


Arg 


Gin 


Ser 


Thr 


Leu 


Val 


Leu 


Phe 


Pro 








20 










25 










30 






Gly 


Asp 


Leu 


Arg 


Thr 


Asn 


Leu 


Thr 


Asp 


Arg 


Gin 


Leu 


Ala 


Glu 


Glu 


Tyr 






35 










40 










45 








Leu 


Tyr 


Arg 


Tyr 


Gly 


Tyr 


Thr 


Arg 


Val 


Ala 


Glu 


Met 


Arg 


Gly Glu 


Ser 




50 










55 










60 










Lys 


Ser 


Leu Gly 


Pro 


Ala 


Leu 


Leu 


Leu 


Leu 


Gin 


Lys 


Gin 


Leu 


Ser 


Leu 


65 










70 










75 










80 


Pro 


Glu 


Thr 


Gly 


Glu 


Leu 


Asp 


Ser 


Ala 


Thr 


Leu 


Lys 


Ala 


Met 


Arg 


Thr 










85 










90 










95 




Pro 


Arg 


Cys 


Gly 


Val 


Pro 


Asp 


Leu 


Gly Arg 


Phe 


Gin 


Thr 


Phe 


Glu 


Gly 








100 










105 










110 






Asp 


Leu 


Lys 


Trp 


His 


His 


His 


Asn 


He 


Thr 


Tyr 


Trp 


He 


Gin 


Asn 


Tyr 






115 










120 










125 








Ser 


Glu 


Asp 


Leu 


Pro 


Arg 


Ala 


Val 


He 


Asp 


Asp 


Ala 


Phe 


Ala 


Arg 


Ala 




130 










135 










140 










Phe 


Ala 


Leu 


Trp 


Ser 


Ala 


Val 


Thr 


Pro 


Leu 


Thr 


Phe 


Thr 


Arg 


Val 


Tyr 


145 










150 










155 










160 


Ser 


Arg 


Asp 


Ala 


Asp 


He 


Val 


He 


Gin 


Phe 


Gly Val 


Ala 


Glu 


His 


Gly 
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Asp 


Gly 


Tyr 










180 


Pro 


Pro 


Gly 


Pro 






195 




Leu 


Trp 


Ser 


Leu 




210 






Ala 


Asp 


Gly 


Ala 


225 








Tyr 


Ser 


Ala 


Cys 


O Ci -L 


J. IlJL 


Thr 


Ala 








260 


Ser 


Glu 


Arg 


Leu 






275 




Gin 


Phe 


Pro 


Phe 




290 






Asp 


Gly 


Arg 


Ser 


305 








Asp 


Arg 


Asp 


Lys 


Vdl 


ivie l. 


(jrJ_y 


Gly 








340 


Phe 


Leu 


Gly 


Lys 






355 




Gly 


Arg 


Leu 


Trp 




370 






Trp 


Gly 


Phe 


Cys 


385 








His 


Glu 


Phe 


Gly 


iii a. 


Leu 




Tyr 








420 


Lys 


Asp 


Asp 


Val 






435 




Pro 


Glu 


Pro 


Arg 




450 






Pro 


Thr 


Val 


Cys 


465 








Pro 


Thr 


Ala 


Gly 


Pro 


Thr 


7\ 1 a 
I\± ci 


oJLy 








500 


Asp 


Asp 


Ala 


Cys 






515 




Asn 


Gin 


Leu 


Tyr 




530 






Gly 


Arg 


Gly 


Ser 


545 








Pro 


Ala 


Leu 


Pro 


Lys 


Lys 




Phe 








580 


Ala 


Ser 


Val 


Leu 






595 




Asp 


Val 


Ala 


Gin 




610 






Leu 


Leu 


Phe 


Ser 


625 








Met 


Val 


Asp 


Pro 


Val 


Pro 


Leu 


Asp 



660 



Phe 


Asp 


Gly 


Lys 


Gly 




Gin 


vjx y 








200 


Gly 


Lys 


Gly 


Val 






215 




Ala 


Cys 


His 


Phe 




230 






Thr 


Thr 


Asp 


Gly 


245 








Asn 


Tyr 


Asp 


Thr 


Tyr 


Thr 


Arg 


Asp 








280 


lie 


Phe 


Gin 


Gly 






295 




Asp 


Gly 


Tyr 


Arg 




310 






Leu 


Phe 


Gly 


Phe 


325 








Asn 


Ser 


Ala 


Gly 


o±U 


Tyr 


Ser 


inr 








360 


Cys 


Ala 


Thr 


Thr 






375 




Pro 


Asp 


Gin 


Gly 




390 






His 


Ala 


Leu 


Gly 


405 








Pro 


Met 


Tyr 


Arg 


Asn 


Gly 


lie 


Arg 








440 


Pro 


Pro 


Thr 


Thr 






455 




Pro 


Thr 


Gly 


Pro 




470 






Pro 


Thr 


Gly 


Pro 


485 








Pro 


Ser 


Thr 


Ala 


As n 


V d _L 


Asn 


± X c 








520 


Leu 


Phe 


Lys 


Asp 






535 




Arg 


Pro 


Gin 


Gly 




550 






Arg 


Lys 


Leu 


Asp 


565 








Phe 


Phe 


Ser 


Gly 


Gly 


Pro 


Arg 


Arg 








600 


Val 


Thr 


Gly 


Ala 






615 




Gly 


Arg 


Arg 


Leu 




630 






Arg 


Ser 


Ala 


Ser 


645 








Thr 


His 


Asp 


Val 



85 



170 

Asp Gly Leu Leu 
185 

Asp Ala His Phe 

Val Val Pro Thr 
220 

Pro Phe lie Phe 
235 

Arg Ser Asp Gly 
250 

Asp Asp Arg Phe 
265 

Gly Asn Ala Asp 

Gin Ser Tyr Ser 
300 

Trp Cys Ala Thr 
315 

Cys Pro Thr Arg 
330 

Glu Leu Cys Val 
345 

Cys Thr Ser Glu 

Ser Asn Phe Asp 
380 

Tyr Ser Leu Phe 
395 

Leu Asp His Ser 
410 

Phe Thr Glu Gly 
425 

His Leu Tyr Gly 

Thr Thr Pro Gin 
460 

Pro Thr Val His 
475 

Pro Ser Ala Gly 
490 

Thr Thr Val Pro 
505 

Phe Asp Ala He 

Gly Lys Tyr Trp 
540 

Pro Phe Leu He 
555 

Ser Val Phe Glu 
570 

Arg Gin Val Trp 
585 

Leu Asp Lys Leu 

Leu Arg Ser Gly 
620 

Trp Arg Phe Asp 
635 

Glu Val Asp Arg 
650 

Phe Gin Tyr Arg 
665 







175 




Ala 


His 


Ala 


Phe 




190 






Asp 


Asp 


Asp 


Glu 


205 








Arg 


Phe 


Gly 


Asn 


Glu 


Gly 


Arg 


Ser 








24 0 


Leu 


Pro 


Trp 


Cys 






255 




Gly 


Phe 


Cys 


Pro 




270 






Gly 


Lys 


Pro 


Cys 


285 








Ala 


Cys 


Thr 


Thr 


Thr 


Ala 


Asn 


Tyr 








3 2 0 


Ala 


Asp 


Ser 


Thr 






335 




Phe 


Pro 


Phe 


Thr 




350 






Gly 


Arg 


Gly 


Asp 


365 








Ser 


Asp 


Lys 


Lys 


Leu 


Val 


Ala 


Ala 








400 


Ser 


Val 


Pro 


Glu 






415 




Pro 


Pro 


Leu 


His 




430 






Pro 


Arg 


Pro 


Glu 


445 








Pro 


Thr 


Ala 


Pro 


Pro 


Ser 


Glu 


Arg 








4 8 0 


Pro 


Thr 


Gly 


Pro 






495 




Leu 


Ser 


Pro 


Val 




510 






Ala 


Glu 


He 


Gly 


525 








Arg 


Phe 


Ser 


Glu 


Ala 


Asp 


Lys 


Trp 








c a n 
o o U 


Glu 


Pro 


Leu 


Ser 






575 




Val 


Tyr 


Thr 


Gly 




590 






Gly 


Leu 


Gly 


Ala 


605 








Arg 


Gly 


Lys 


Met 


Val 


Lys 


Ala 


Gin 








640 


Met 


Phe 


Pro 


Gly 






655 




Glu 


Lys 


Ala 


Tyr 




670 
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Phe Cys Gin Asp Arg 
675 

Asn Gin Val Asp Gin 

690 
Pro Glu Asp 
705 

<210> 133 
<211> 115 
<212> PRT 
<213> Homo sapiens 

<400> 133 



lie 


Pro 


Thr 


Glu 


He 


Pro 


Thr 


Ser 


Ala 


Leu 


Val 


Lys 


Glu 


Thr 


Leu 


Ala 


1 








5 










10 










15 




Leu 


Leu 


Ser 


Thr 


His 


Arg 


Thr 


Leu 


Leu 


He 


Ala 


Asn 


Glu 


Thr 


Leu 


Arg 








20 










25 










30 






He 


Pro 


Val 


Pro 


Val 


His 


Lys 


Asn 


His 


Gin 


Leu 


Cys 


Thr 


Glu 


Glu 


He 






35 










40 










45 








Phe 


Gin 


Gly 


He 


Gly 


Thr 


Leu 


Glu 


Ser 


Gin 


Thr 


Val 


Gin 


Gly 


Gly 


Thr 




50 










55 










60 










Val 


Glu 


Arg 


Leu 


Phe 


Lys 


Asn 


Leu 


Ser 


Leu 


He 


Lys 


Lys 


Tyr 


He 


Asp 


65 










70 










75 










80 


Gly 


Gin 


Lys 


Lys 


Lys 


Cys 


Gly Glu 


Glu 


Arg 


Arg 


Arg 


Val 


Asn 


Gin 


Phe 










85 










90 










95 




Leu 


Asp 


Tyr 


Leu 


Gin 


Glu 


Phe 


Leu 


Gly Val 


Met 


Asn 


Thr 


Glu 


Trp 


He 








100 










105 










110 







He Glu Ser 



115 



Phe Tyr Trp Arg Val Ser Ser Arg Ser Glu Leu 

680 685 
Val Gly Tyr Val Thr Tyr Asp He Leu Gin Cys 
695 700 



<210> 134 

<211> 185 

<212> PRT 

<213> Homo sapiens 

<400> 134 



Ala 


Pro 


Val 


Pro 


Pro 


Gly 


Glu 


Asp 


Ser 


Lys 


Asp 


Val 


Ala 


Ala 


Pro 


His 


1 








5 










10 










15 




Arg 


Gin 


Pro 


Leu 


Thr 


Ser 


Ser 


Glu 


Arg 


He 


Asp 


Lys 


Gin 


He 


Arg 


Tyr 








20 










25 










30 






He 


Leu 


Asp 


Gly 


He 


Ser 


Ala 


Leu 


Arg 


Lys 


Glu 


Thr 


Cys 


Asn 


Lys 


Ser 






35 










40 










45 








Asn 


Met 


Cys 


Glu 


Ser 


Ser 


Lys 


Glu 


Ala 


Leu 


Ala 


Glu 


Asn 


Asn 


Leu 


Asn 




50 










55 










60 










Leu 


Pro 


Lys 


Met 


Ala 


Glu 


Lys 


Asp 


Gly 


Cys 


Phe 


Gin 


Ser 


Gly 


Phe 


Asn 


65 










70 










75 










80 


Glu 


Glu 


Thr 


Cys 


Leu 


Val 


Lys 


He 


He 


Thr 


Gly 


Leu 


Leu 


Glu 


Phe 


Glu 










85 










90 










95 




Val 


Tyr 


Leu 


Glu 


Tyr 


Leu 


Gin 


Asn 


Arg 


Phe 


Glu 


Ser 


Ser 


Glu 


Glu 


Gin 








100 










105 










110 






Ala 


Arg 


Ala 


Val 


Gin 


Met 


Ser 


Thr 


Lys 


Val 


Leu 


He 


Gin 


Phe 


Leu 


Gin 






115 










120 










125 








Lys 


Lys 


Ala 


Lys 


Asn 


Leu 


Asp 


Ala 


He 


Thr 


Thr 


Pro 


Asp 


Pro 


Thr 


Thr 




130 










135 










140 










Asn 


Ala 


Ser 


Leu 


Leu 


Thr 


Lys 


Leu 


Gin 


Ala 


Gin 


Asn 


Gin 


Trp 


Leu 


Gin 


145 










150 










155 










160 


Asp 


Met 


Thr 


Thr 


His 


Leu 


He 


Leu 


Arg 


Ser 


Phe 


Lys 


Glu 


Phe 


Leu 


Gin 










165 










170 










175 




Ser 


Ser 


Leu 


Arg 


Ala 


Leu 


Arg 


Gin 


Met 

















180 185 



WO 2004/113522 PCT/EP2004/051173 

87 

<210> 135 
<211> 160 
<212> PRT 
<213> Homo sapiens 



<400> 135 



Ser 


Pro 


Gly 


Gin 


Gly 


Thr 


Gin 


Ser 


Glu 


Asn 


Ser 


Cys 


Thr 


His 


Phe 


Pro 


1 








5 










10 










15 




Gly Asn 


Leu 


Pro 


Asn 


Met 


Leu 


Arg 


Asp 


Leu 


Arg 


Asp 


Ala 


Phe 


Ser 


Arg 








20 










25 










30 






Val 


Lys 


Thr 


Phe 


Phe 


Gin 


Met 


Lys 


Asp 


Gin 


Leu 


Asp 


Asn 


Leu 


Leu 


Leu 






35 










40 










45 








Lys 


Glu 


Ser 


Leu 


Leu 


Glu 


Asp 


Phe 


Lys 


Gly Tyr 


Leu 


Gly 


Cys 


Gin 


Ala 




50 










55 










60 










Leu 


Ser 


Glu 


Met 


He 


Gin 


Phe 


Tyr 


Leu 


Glu 


Glu 


Val 


Met 


Pro 


Gin 


Ala 


65 










70 










75 










80 


Glu 


Asn 


Gin 


Asp 


Pro 


Asp 


He 


Lys 


Ala 


His 


Val 


Asn 


Ser 


Leu 


Gly 


Glu 










85 










90 










95 




Asn 


Leu 


Lys 


Thr 


Leu 


Arg 


Leu 


Arg 


Leu 


Arg 


Arg 


Cys 


His 


Arg 


Pne 


Leu 








100 










105 










110 






Pro 


Cys 


Glu 


Asn 


Lys 


Ser 


Lys 


Ala 


Val 


Glu 


Gin 


Val 


Lys 


Asn 


Ala 


Phe 






115 










120 










125 








Asn 


Lys 


Leu 


Gin 


Glu 


Lys 


Gly 


He 


Tyr 


Lys 


Ala 


Met 


Ser 


Glu 


Phe 


Asp 




130 










135 










140 










He 


Phe 


He 


Asn 


Tyr 


He 


Glu 


Ala 


Tyr 


Met 


Thr 


Met 


Lys 


He 


Arg 


Asn 


145 










150 










155 










160 


<210> 136 




























<211> 472 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 136 




























Glu 


Met 


Gly 


Thr 


Ala 


Asp 


Leu 


Gly 


Pro 


Ser 


Ser 


Val 


Pro 


Thr 


Pro 


Thr 


1 








5 










10 










15 




Asn 


Val 


Thr 


He 


Glu 


Ser 


Tyr 


Asn 


Met 


Asn 


Pro 


He 


Val 


Tyr 


Trp 


Glu 








20 










25 










30 






Tyr 


Gin 


He 


Met 


Pro 


Gin 


Val 


Pro 


Val 


Phe 


Thr 


Val 


Glu 


Val 


Lys 


Asn 






35 










40 










45 








Tyr 


Gly 


Val 


Lys 


Asn 


Ser 


Glu 


Trp 


He 


Asp 


Ala 


Cys 


He 


Asn 


He 


Ser 




50 










55 










60 










His 


His 


Tyr 


Cys 


Asn 


He 


Ser 


Asp 


His 


Val 


Gly 


Asp 


Pro 


Ser 


Asn 


Ser 


65 










70 










75 










80 


Leu 


Trp 


Val 


Arg 


Val 


Lys 


Ala 


Arg 


Val 


Gly 


Gin 


Lys 


Glu 


Ser 


Ala 


Tyr 










85 










90 










95 




Ala 


Lys 


Ser 


Glu 


Glu 


Phe 


Ala 


Val 


Cys 


Arg 


Asp 


Gly 


Lys 


He 


Gly 


Pro 








100 










105 










110 






Pro 


Lys 


Leu 


Asp 


He 


Arg 


Lys 


Glu 


Glu 


Lys 


Gin 


He 


Met 


He 


Asp 


He 






115 










120 










125 








Phe 


His 


Pro 


Ser 


Val 


Phe 


Val 


Asn 


Gly Asp 


Glu 


Gin 


Glu 


Val 


Asp 


Tyr 




130 










135 










140 










Asp 


Pro 


Glu 


Thr 


Thr 


Cys 


Tyr 


He 


Arg 


Val 


Tyr 


Asn 


Val 


Tyr 


Val 


Arg 


145 










150 










155 










160 


Met 


Asn 


Gly 


Ser 


Glu 


He 


Gin 


Tyr 


Lys 


He 


Leu 


Thr 


Gin 


Lys 


Glu 


Asp 










165 










170 










175 




Asp 


Cys 


Asp 


Glu 


He 


Gin 


Cys 


Gin 


Leu 


Ala 


He 


Pro 


Val 


Ser 


Ser 


Leu 








180 










185 










190 






Asn 


Ser 


Gin 


Tyr 


Cys 


Val 


Ser 


Ala 


Glu 


Gly Val 


Leu 


His 


Val 


Trp 


Gly 






195 










200 










205 








Val 


Thr 


Thr 


Glu 


Lys 


Ser 


Lys 


Glu 


Val 


Cys 


He 


Thr 


He 


Phe 


Asn 


Ser 




210 










215 










220 










Ser 


He 


Lys 


Gly 


Ser 


Leu 


Trp 


He 


Pro 


Val 


Val 


Ala 


Ala 


Leu 


Leu 


Leu 
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225 230 235 240 

Phe Leu Val Leu Ser Leu Val Phe lie Cys Phe Tyr lie Lys Lys lie 

245 250 255 

Asn Pro Leu Lys Glu Lys Ser lie lie Leu Pro Lys Ser Leu lie Ser 

260 265 270 

Val Val Arg Ser Ala Thr Leu Glu Thr Lys Pro Glu Ser Lys Tyr Val 

275 280 285 

Ser Leu lie Thr Ser Tyr Gin Pro Phe Ser Leu Glu Lys Glu Val Val 

290 295 300 

Cys Glu Glu Pro Leu Ser Pro Ala Thr Val Pro Gly Met His Thr Glu 
305 310 315 320 

Asp Asn Pro Gly Lys Val Glu His Thr Glu Glu Leu Ser Ser lie Thr 

325 330 335 

Glu Val Val Thr Thr Glu Glu Asn He Pro Asp Val Val Pro Gly Ser 

340 345 350 

His Leu Thr Pro He Glu Arg Glu Ser Ser Ser Pro Leu Ser Ser Asn 

355 3.60 365 

Gin Ser Glu Pro Gly Ser lie Ala Leu Asn Ser Tyr His Ser Arg Asn 

370 375 380 

Cys Ser Glu Ser Asp His Ser Arg Asn Gly Phe Asp Thr Asp Ser Ser 
385 390 395 400 

Cys Leu Glu Ser His Ser Ser Leu Ser Asp Ser Glu Phe Pro Pro Asn 

405 410 415 

Asn Lys Gly Glu He Lys Thr Glu Gly Gin Glu Leu He Thr Val He 

420 425 430 

Lys Ala Pro Thr Ser Phe Gly Tyr Asp Lys Pro His Val Leu Val Asp 

435 440 445 

Leu Leu Val Asp Asp Ser Gly Lys Glu Ser Leu He Gly Tyr Arg Pro 

450 455 460 

Thr Glu Asp Ser Lys Glu Phe Ser 
465 470 



<210> 137 

<211> 143 

<212> PRT 

<213> Homo sapiens 



<400> 137 




























Gin 


Asp 


Pro 


Tyr 


Val 


Lys 


Glu 


Ala 


Glu 


Asn 


Leu 


Lys 


Lys 


Tyr 


Phe 


Asn 


1 








5 










10 










15 




Ala 


Gly 


His 


Ser 


Asp 


Val 


Ala 


Asp 


Asn 


Gly 


Thr 


Leu 


Phe 


Leu 


Gly 


He 








20 










25 










30 






Leu 


Lys 


Asn 


Trp 


Lys 


Glu 


Glu 


Ser 


Asp 


Arg 


Lys 


He 


Met 


Gin 


Ser 


Gin 






35 










40 










45 








He 


Val 


Ser 


Phe 


Tyr 


Phe 


Lys 


Leu 


Phe 


Lys 


Asn 


Phe 


Lys 


Asp 


Asp 


Gin 




50 










55 










60 










Ser 


He 


Gin 


Lys 


Ser 


Val 


Glu 


Thr 


He 


Lys 


Glu 


Asp 


Met 


Asn 


Val 


Lys 


65 










70 










75 










80 


Phe 


Phe 


Asn 


Ser 


Asn 


Lys 


Lys 


Lys 


Arg 


Asp 


Asp 


Phe 


Glu 


Lys 


Leu 


Thr 










85 










90 










95 




Asn 


Tyr 


Ser 


Val 


Thr 


Asp 


Leu 


Asn 


Val 


Gin 


Arg 


Lys 


Ala 


He 


His 


Glu 








100 










105 










110 






Leu 


He 


Gin 


Val 


Met 


Ala 


Glu 


Leu 


Ser 


Pro 


Ala 


Ala 


Lys 


Thr 


Gly 


Lys 






115 










120 










125 








Arg 


Lys 


Arg 


Ser 


Gin 


Met 


Leu 


Phe 


Arg 


Gly Arg 


Arg 


Ala 


Ser 


Gin 






130 










135 










140 











<210> 138 
<211> 143 
<212> PRT 
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<213> Homo sapiens 
<400> 138 



Met 


Glu 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro 


His 


Arg 


Trp 


Cys 


He 


Pro 


Trp 


Gin 


1 








5 










10 










15 




Arg 


Leu 


Leu 


Leu 


Thr 


Ala 


Ser 


Leu 


Leu 


Thr 


Phe 


Trp 


Asn 


Pro 


Pro 


Thr 








20 










25 










30 






Thr 


Ala 


Lys 


Leu 


Thr 


He 


Glu 


Ser 


Thr 


Pro 


Phe 


Asn 


Val 


Ala 


Glu 


Gly 






35 










40 










45 








Lys 


Glu 


Val 


Leu 


Leu 


Leu 


Val 


His 


Asn 


Leu 


Pro 


Gin 


His 


Leu 


Phe 


Gly 




50 










55 










60 










Tyr 


Ser 


Trp 


Tyr 


Lys 


Gly 


Glu 


Arg 


Val 


Asp 


Gly Asn 


Arg 


Gin 


He 


He 


65 










70 










75 










80 


Gly 


Tyr 


Val 


He 


Gly 


Thr 


Gin 


Gin 


Ala 


Thr 


Pro 


Gly 


Pro 


Ala 


Tyr 


Ser 










85 










90 










95 




Gly 


Arg 


Glu 


He 


He 


Tyr 


Pro 


Asn 


Ala 


Ser 


Leu 


Leu 


He 


Gin 


Asn 


lie 








100 










105 










110 






lie 


Gin 


Asn 


Asp 


Thr 


Gly 


Phe 


Tyr 


Thr 


Leu 


His 


Val 


He 


Lys 


Ser 


Asp 






115 










120 










125 








Leu 


Val 


Asn 


Glu 


Glu 


Ala 


Thr 


Gly 


Gin 


Phe 


Arg 


Val 


Tyr 


Arg 


Glu 






130 










135 










140 











<210> 139 

<211> 440 

<212> PRT 

<213> Homo sapiens 



<400> 139 



Glu 


Met 


Val 


Asp 


1 








Glu 


Met 


Thr 


Val 








20 


Thr 


Gly 


Ser 


Ser 






35 




His 


Arg 


Tyr 


Tyr 




50 






Lys 


Gly 


Val 


Tyr 


65 








Gly 


Thr 


Asp 


Leu 


Ala 


Asn 


He 


Ala 








100 


Ser 


Asn 


Trp 


Glu 






115 




Pro 


Asp 


Asp 


Ser 




130 






His 


Val 


Pro 


Asn 


145 








Leu 


Asn 


Gin 


Ser 


Gly 


Gly 


He 


Asp 








180 


He 


Arg 


Arg 


Glu 






195 




Asn 


Gly 


Gin 


Asp 




210 






Ser 


He 


Val 


Asp 


225 








Phe 


Glu 


Ala 


Ala 


Phe 


Pro 


Asp 


Gly 



Asn Leu Arg Gly 
5 

Gly Ser Pro Pro 

Asn Phe Ala Val 
40 

Gin Arg Gin Leu 
55 

Val Pro Tyr Thr 
70 

Val Ser He Pro 
85 

Ala He Thr Glu 

Gly He Leu Gly 
120 

Leu Glu Pro Phe 
135 

Leu Phe Ser Leu 
150 

Glu Val Leu Ala 
165 

His Ser Leu Tyr 

Trp Tyr Tyr Glu 
200 

Leu Lys Met Asp 
215 

Ser Gly Thr Thr 
230 

Val Lys Ser He 
245 

Phe Trp Leu Gly 



Lys 


Ser 


Gly Gin 




10 






Gin 


Thr 


Leu 


Asn 


25 








Gly 


Ala 


Ala 


Pro 


Ser 


Ser 


Thr 


Tyr 








60 


Gin 


Gly 


Lys 


Trp 






75 




His 


Gly 


Pro 


Asn 




90 






Ser 


Asp 


Lys 


Phe 


105 








Leu 


Ala 


Tyr 


Ala 


Phe 


Asp 


Ser 


Leu 








140 


Gin 


Leu 


Cys 


Gly 






155 




Ser 


Val 


Gly 


Gly 




170 






Thr 


Gly 


Ser 


Leu 


185 








Val 


He 


He 


Val 


Cys 


Lys 


Glu 


Tyr 








220 


Asn 


Leu 


Arg 


Leu 






235 




Lys 


Ala 


Ala 


Ser 




250 






Glu 


Gin 


Leu 


Val 



265 



Gly Tyr Tyr Val 
15 

He Leu Val Asp 
30 

His Pro Phe Leu 
45 

Arg Asp Leu Arg 

Glu Gly Glu Leu 
80 

Val Thr Val Arg 
95 

Phe He Asn Gly 
110 

Glu He Ala Arg 
125 

Val Lys Gin Thr 

Ala Gly Phe Pro 
160 

Ser Met He He 
175 

Trp Tyr Thr Pro 
190 

Arg Val Glu He 
205 

Asn Tyr Asp Lys 

Pro Lys Lys Val 
240 

Ser Thr Glu Lys 
255 

Cys Trp Gin Ala 
270 
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90 



Gly 


Thr 


Thr 
275 


Pro 


Trp 


Asn 


He 


Phe 
280 


Pro 


Val 


He 


Ser 


Leu 
285 


Tyr 


Leu 


Met 


Gly 


Glu 
290 


Val 


Thr 


Asn 


Gin 


Ser 
295 


Phe 


Arg 


He 


Thr 


He 
300 


Leu 


Pro 


Gin 


Gin 


Tyr 


Leu 


Arg 


Pro 


Val 


Glu 


Asp 


Val 


Ala 


Thr 


Ser 


Gin 


Asp 


Asp 


Cys 


Tyr 


305 










310 










315 










320 


Lys 


Phe 


Ala 


He 


Ser 
325 


Gin 


Ser 


Ser 


Thr 


Gly 
330 


Thr 


Val 


Met 


Gly 


Ala 
335 


Val 


He 


Met 


Glu 


Gly 
340 


Phe 


Tyr 


Val 


Val 


Phe 
345 


Asp 


Arg 


Ala 


Arg 


Lys 
350 


Arg 


He 


Gly 


Phe 


Ala 
355 


Val 


Ser 


Ala 


Cys 


His 
360 


Val 


His 


Asp 


Glu 


Phe 
365 


Arg 


Thr 


Ala 


Ala 


Val 
370 


Glu 


Gly 


Pro 


Phe 


Val 
375 


Thr 


Leu 


Asp 


Met 


Glu 
380 


Asp 


Cys 


Gly 


Tyr 


Asn 


He 


Pro 


Gin 


Thr 


Asp 


Glu 


Ser 


Thr 


Leu 


Met 


Thr 


He 


Ala 


Tyr 


Val 


385 










390 










395 








400 


Met 


Ala 


Ala 


lie 


Cys 
405 


Ala 


Leu 


Phe 


Met 


Leu 
410 


Pro 


Leu 


Cys 


Leu 


Met 
415 


Val 


Cys 


Gin 


Trp 


Arg 
420 


Cys 


Leu 


Arg 


Cys 


Leu 
425 


Arg 


Gin 


Gin 


His 


Asp 
43 0 


Asp 


Phe 


Ala 


Asp 


Asp 
435 


He 


Ser 


Leu 


Leu 


Lys 
440 



















<210> 140 

<211> 810 

<212> PRT 

<213> Homo sapiens 



<400> 140 



Met 


Glu 


His 


Lys 


Glu 


Val 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Phe 


Leu 


Lys 


Ser 


1 








5 










10 










15 




Gly 


Gin 


Gly 


Glu 


Pro 


Leu 


Asp 


Asp 


Tyr 


Val 


Asn 


Thr 


Gin 


Gly 


Ala 


Ser 








20 










25 










30 






Leu 


Phe 


Ser 


Val 


Thr 


Lys 


Lys 


Gin 


Leu 


Gly 


Ala 


Gly 


Ser 


He 


Glu 


Glu 






35 










40 










45 








Cys 


Ala 


Ala 


Lys 


Cys 


Glu 


Glu 


Asp 


Glu 


Glu 


Phe 


Thr 


Cys 


Arg 


Ala 


Phe 




50 










55 










60 










Gin 


Tyr 


His 


Ser 


Lys 


Glu 


Gin 


Gin 


Cys 


Val 


He 


Met 


Ala 


Glu 


Asn 


Arg 


65 










70 










75 










80 


Lys 


Ser 


Ser 


He 


He 


He 


Arg 


Met 


Arg 


Asp 


Val 


Val 


Leu 


Phe 


Glu 


Lys 










85 










90 










95 




Lys 


Val 


Tyr 


Leu 


Ser 


Glu 


Cys 


Lys 


Thr 


Gly Asn Gly 


Lys 


Asn 


Tyr 


Arg 








100 










105 










110 






Gly 


Thr 


Met 


Ser 


Lys 


Thr 


Lys 


Asn 


Gly 


He 


Thr 


Cys 


Gin 


Lys 


Trp 


Ser 






115 










120 










125 








Ser 


Thr 


Ser 


Pro 


His 


Arg 


Pro 


Arg 


Phe 


Ser 


Pro 


Ala 


Thr 


His 


Pro 


Ser 




130 










135 










140 










Glu 


Gly 


Leu 


Glu 


Glu 


Asn 


Tyr 


Cys 


Arg 


Asn 


Pro 


Asp 


Asn 


Asp 


Pro 


Gin 


145 










150 










155 










160 


Gly 


Pro 


Trp 


Cys 


Tyr 


Thr 


Thr 


Asp 


Pro 


Glu 


Lys 


Arg 


Tyr 


Asp 


Tyr 


Cys 










165 










170 










175 




Asp 


He 


Leu 


Glu 


Cys 


Glu 


Glu 


Glu 


Cys 


Met 


His 


Cys 


Ser 


Gly 


Glu 


Asn 








180 










185 










190 






Tyr 


Asp 


Gly 


Lys 


He 


Ser 


Lys 


Thr 


Met 


Ser 


Gly 


Leu 


Glu 


Cys 


Gin 


Ala 






195 










200 










205 








Trp 


Asp 


Ser 


Gin 


Ser 


Pro 


His 


Ala 


His 


Gly 


Tyr 


He 


Pro 


Ser 


Lys 


Phe 




210 










215 










220 










Pro 


Asn 


Lys 


Asn 


Leu 


Lys 


Lys 


Asn 


Tyr 


Cys 


Arg 


Asn 


Pro 


Asp 


Arg 


Glu 


225 










230 










235 










240 


Leu 


Arg 


Pro 


Trp 


Cys 


Phe 


Thr 


Thr 


Asp 


Pro 


Asn 


Lys 


Arg 


Trp 


Glu 


Leu 










245 










250 










255 




Cys 


Asp 


He 


Pro 


Arg 


Cys 


Thr 


Thr 


Pro 


Pro 


Pro 


Ser 


Ser 


Gly 


Pro 


Thr 
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Tyr Gin Cys Leu Lys Gly Thr Gly Glu Asn Tyr Arg Gly Asn Val Ala 

275 280 285 

Val Thr Val Ser Gly His Thr Cys Gin His Trp Ser Ala Gin Thr Pro 

290 295 300 

His Thr His Asn Arg Thr Pro Glu Asn Phe Pro Cys Lys Asn Leu Asp 
305 310 315 ' 320 

Glu Asn Tyr Cys Arg Asn Pro Asp Gly Lys Arg Ala Pro Trp Cys His 

325 33 0 335 

Thr Thr Asn Ser Gin Val Arg Trp Glu Tyr Cys Lys lie Pro Ser Cys 

340 345 350 

Asp Ser Ser Pro Val Ser Thr Glu Gin Leu Ala Pro Thr Ala Pro Pro 

355 360 365 

Glu Leu Thr Pro Val Val Gin Asp Cys Tyr His Gly Asp Gly Gin Ser 

370 375 380 

Tyr Arg Gly Thr Ser Ser Thr Thr Thr Thr Gly Lys Lys Cys Gin Ser 
385 390 395 ~ 400 

Trp Ser Ser Met Thr Pro His Arg His Gin Lys Thr Pro Glu Asn Tyr 

405 410 415 

Pro Asn Ala Gly Leu Thr Met Asn Tyr Cys Arg Asn Pro Asp Ala Asp 

420 425 430 

Lys Gly Pro Trp Cys Phe Thr Thr Asp Pro Ser Val Arg Trp Glu Tyr 

435 440 445 

Cys Asn Leu Lys Lys Cys Ser Gly Thr Glu Ala Ser Val Val Ala Pro 

450 455 460 

Pro Pro Val Val Leu Leu Pro Asp Val Glu Thr Pro Ser Glu Glu Asp 
465 470 475 480 

Cys Met Phe Gly Asn Gly Lys Gly Tyr Arg Gly Lys Arg Ala Thr Thr 

485 490 495 

Val Thr Gly Thr Pro Cys Gin Asp Trp Ala Ala Gin Glu Pro His Arg 

500 505 510 

His Ser He Phe Thr Pro Glu Thr Asn Pro Arg Ala Gly Leu Glu Lys 

515, 52 0 525 

Asn Tyr Cys Arg Asn Pro Asp Gly Asp Val Gly Gly Pro Trp Cys Tyr 

530 535 540 

Thr Thr Asn Pro Arg Lys Leu Tyr Asp Tyr Cys Asp Val Pro Gin Cys 
545 550 555 560 

Ala Ala Pro Ser Phe Asp Cys Gly Lys Pro Gin Val Glu Pro Lys Lys 

565 570 575 

Cys Pro Gly Arg Val Val Gly Gly Cys Val Ala His Pro His Ser Trp 

580 585 590 

Pro Trp Gin Val Ser Leu Arg Thr Arg Phe Gly Met His Phe Cys Gly 

595 600 605 

Gly Thr Leu He Ser Pro Glu Trp Val Leu Thr Ala Ala His Cys Leu 

610 615 620 

Glu Lys Ser Pro Arg Pro Ser Ser Tyr Lys Val lie Leu Gly Ala His 
625 630 635 640 

Gin Glu Val Asn Leu Glu Pro His Val Gin Glu He Glu Val Ser Arg 

645 65 0 6 55 

Leu Phe Leu Glu Pro Thr Arg Lys Asp He Ala Leu Leu Lys Leu Ser 

660 665 670 

Ser Pro Ala Val He Thr Asp Lys Val He Pro Ala Cys Leu Pro Ser 

675 680 685 

Pro Asn Tyr Val Val Ala Asp Arg Thr Glu Cys Phe He Thr Gly Trp 

690 695 ~ 700 

Gly Glu Thr Gin Gly Thr Phe Gly Ala Gly Leu Leu Lys Glu Ala Gin 
705 710 715 ~ 720 

Leu Pro Val He Glu Asn Lys Val Cys Asn Arg Tyr Glu Phe Leu Asn 

725 730 735 

Gly Arg Val Gin Ser Thr Glu Leu Cys Ala Gly His Leu Ala Gly Gly 

74 0 745 750 

Thr Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Phe Glu 

755 760 765 

Lys Asp Lys Tyr He Leu Gin Gly Val Thr Ser Trp Gly Leu Gly Cys 
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770 775 780 

Ala Arg Pro Asn Lys Pro Gly Val Tyr Val Arg Val Ser Arg Phe Val 
785 790 795 800 

Thr Trp He Glu Gly Val Met Arg Asn Asn 
805 810 



<210> 141 

<211> 762 

<212> PRT 

<213> Homo sapiens 



<400> 141 

Gly Pro Asn He 

1 

Leu Ala Val Ser 
20 

Leu Gly Ser Pro 
35 

Cys Ala Pro Glu 
50 

Glu Asp Arg Pro 
65 

Thr Gin Val Ser 

Ser Lys Asn Phe 
100 

Asp He Tyr Tyr 
115 

Trp Ser He Gin 
130 

Leu Thr Ser Asn 
145 

Val Ser Pro Tyr 

Cys Tyr Asp Met 
180 

Val Leu Thr Leu 
195 

Lys Gin Ser Val 
210 

Ala He Met Gin 
225 

Asp Ala Ser His 

Ala Leu Asp Gly 
260 

Cys His Val Gly 
275 

Tyr Pro Ser Leu 
290 

Asn Leu He Phe 
305 

Tyr Ser Glu Leu 

Ser Ser Asn Val 
340 

Ser Lys Val Glu 
355 

Ser Phe Asn Ala 
370 

Ser Cys Met Gly 
3 85 



Cys Thr Thr Arg 
5 

Pro Met Cys Ala 

Arg Cys Asp Leu 
40 

Ser He Glu Phe 
55 

Leu Ser Asp Lys 
70 

Pro Gin Arg He 
85 

Ser He Gin Val 

Leu Met Asp Leu 
120 

Asn Leu Gly Thr 
135 

Leu Arg He Gly 
150 

Met Tyr He Ser 
165 

Lys Thr Thr Cys 

Thr Asp Gin Val 
200 

Ser Arg Asn Arg 
215 

Ala Thr Val Cys 
230 

Leu Leu Val Phe 
245 

Arg Leu Ala Gly 

Ser Asp Asn His 
280 

Gly Leu Met Thr 
295 

Ala Val Thr Glu 
310 

He Pro Gly Thr 
325 

Leu Gin Leu He 

Leu Glu Val Arg 
360 

Thr Cys Leu Asn 
375 

Leu Lys He Gly 
390 



Gly Val Ser Ser 
10 

Trp Cys Ser Asp 
25 

Lys Glu Asn Leu 

Pro Val Ser Glu 
60 

Gly Ser Gly Asp 
75 

Ala Leu Arg Leu 
90 

Arg Gin Val Glu 
105 

Ser Tyr Ser Met 

Lys Leu Ala Thr 
140 

Phe Gly Ala Phe 
155 

Pro Pro Glu Ala 
170 

Leu Pro Met Phe 
185 

Thr Arg Phe Asn 

Asp Ala Pro Glu 
220 

Asp Glu Lys He 
235 

Thr Thr Asp Ala 
250 

He Val Gin Pro 
265 

Tyr Ser Ala Ser 

Glu Lys Leu Ser 
300 

Asn Val Val Asn 
315 

Thr Val Gly Val 
330 

Val Asp Ala Tyr 
345 

Asp Leu Pro Glu 

Asn Glu Val He 
380 

Asp Thr Val Ser 
395 



Cys Gin Gin Cys 
15 

Glu Ala Leu Pro 
30 

Leu Lys Asp Asn 
45 

Ala Arg Val Leu 

Ser Ser Gin Val 
80 

Arg Pro Asp Asp 
95 

Asp Tyr Pro Val 
110 

Lys Asp Asp Leu 
125 

Gin Met Arg Lys 

Val Asp -Lys Pro 
160 

Leu Glu Asn Pro 
175 

Gly Tyr Lys His 
190 

Glu Glu Val Lys 
2 05 

Gly Gly Phe Asp 

Gly Trp Arg Asn 
240 

Lys Thr His lie 
255 

Asn Asp Gly Gin 
270 

Thr Thr Met Asp 
2 85 

Gin Lys Asn He 

Leu Tyr Gin Asn 
320 

Leu Ser Met Asp 
335 

Gly Lys He Arg 
350 

Glu Leu Ser Leu 
365 

Pro Gly Leu Lys 

Phe Ser He Glu 
400 
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Ala 


Lys 


Val 


Arg 


Gly 


Cys 


Pro 


Gin 


Glu 


Lys 


Glu 


Lys 


Ser 


Phe 


Thr 


He 










405 










410 










415 




Lys 


Pro 


Val 


Gly 


Phe 


Lys 


Asp 


Ser 


Leu 


He 


Val 


Gin 


Val 


Thr 


Phe 


Asp 








420 










425 










430 






Cys 


Asp 


Cys 


Ala 


Cys 


Gin 


Ala 


Gin 


Ala 


Glu 


Pro 


Asn 


Ser 


His 


Arg' 


Cys 






435 










440 










445 








Asn 


Asn 


Gly Asn 


Gly 


Thr 


Phe 


Glu 


Cys 


Gly 


Val 


Cys 


Arg 


Cys 


Gly 


Pro 




450 










455 










460 










Gly 


Trp 


Leu 


Gly 


Ser 


Gin 


Cys 


Glu 


Cys 


Ser 


Glu 


Glu 


Asp 


Tyr 


Arg 


Pro 


465 










470 










475 










480 


Ser 


Gin 


Gin 


Asp 


Glu 


Cys 


Ser 


Pro 


Arg 


Glu 


Gly Gin 


Pro 


Val 


Cys 


Ser 










485 










490 










495 




Gin 


Arg 


Gly 


Glu 


Cys 


Leu 


Cys 


Gly 


Gin 


Cys 


Val 


Cys 


His 


Ser 


Ser 


Asp 








500 










505 










510 






Phe 


Gly 


Lys 


He 


Thr 


Gly Lys 


Tyr 


Cys 


Glu 


Cys 


Asp 


Asp 


Phe 


Ser 


Cys 






515 










520 










525 








Val 


Arg 


Tyr 


Lys 


Gly Glu 


Met 


Cys 


Ser 


Gly 


His 


Gly 


Gin 


Cys 


Ser 


Cys 




530 










535 










540 










Gly Asp 


Cys 


Leu 


Cys 


Asp 


Ser 


Asp 


Trp 


Thr 


Gly Tyr 


Tyr 


Cys 


Asn 


Cys 


545 










550 










555 










560 


Thr 


Thr 


Arg 


Thr 


Asp 


Thr 


Cys 


Met 


Ser 


Ser 


Asn 


Gly 


Leu 


Leu 


Cys 


Ser 










565 










570 










575 




Gly Arg 


Gly 


Lys 


Cys 


Glu 


Cys 


Gly 


Ser 


Cys 


Val 


Cys 


He 


Gin 


Pro 


Gly 








580 










585 










590 






Ser 


Tyr 


Gly Asp 


Thr 


Cys 


Glu 


Lys 


Cys 


Pro 


Thr 


Cys 


Pro 


Asp 


Ala 


Cys 






595 










600 










605 








Thr 


Phe 


Lys 


Lys 


Glu 


Cys 


Val 


Glu 


Cys 


Lys 


Lys 


Phe 


Asp 


Arg 


Glu 


Pro 




610 










615 










620 










Tyr 


Met 


Thr 


Glu 


Asn 


Thr 


Cys 


Asn 


Arg 


Tyr 


Cys 


Arg 


Asp 


Glu 


He 


Glu 


625 










630 










635 










640 


Ser 


Val 


Lys 


Glu 


Leu 


Lys 


Asp 


Thr 


Gly 


Lys 


Asp 


Ala 


Val 


Asn 


Cys 


Thr 










645 










650 










655 




Tyr 


Lys 


Asn 


Glu 


Asp 


Asp 


Cys 


Val 


Val 


Arg 


Phe 


Gin 


Tyr 


Tyr 


Glu 


Asp 








660 










665 










670 






Ser 


Ser 


Gly 


Lys 


Ser 


lie 


Leu 


Tyr 


Val 


Val 


Glu 


Glu 


Pro 


Glu 


Cys 


Pro 






675 










680 










685 








Lys 


Gly 


Pro 


Asp 


He 


Leu 


Val 


Val 


Leu 


Leu 


Ser 


Val 


Met 


Gly 


Ala 


He 




690 










695 










700 










Leu 


Leu 


He 


Gly Leu Ala Ala 


Leu 


Leu 


lie 


Trp 


Lys 


Leu 


Leu 


He 


Thr 


705 










710 










715 










720 


lie 


His 


Asp 


Arg 


Lys 


Glu 


Phe 


Ala 


Lys 


Phe 


Glu 


Glu 


Glu 


Arg 


Ala 


Arg 










725 










730 










735 




Ala 


Lys 


Trp 


Asp 


Thr 


Ala 


Asn 


Asn 


Pro 


Leu 


Tyr 


Lys 


Glu 


Ala 


Thr 


Ser 








740 










745 










750 






Thr 


Phe 


Thr 


Asn 


He 


Thr 


Tyr 


Arg 


Gly 


Thr 















755 760 



<210> 142 

<211> 505 

<212> PRT 

<213> Homo sapiens 



<400> 142 



Gin 


Thr 


Ser 


Val 


Ser 


Pro 


Ser 


Lys 


Val 


He 


Leu 


Pro 


Arg 


Gly 


Gly 


Ser 


1 








5 










10 










15 




Val 


Leu 


Val 


Thr 


Cys 


Ser 


Thr 


Ser 


Cys 


Asp 


Gin 


Pro 


Lys 


Leu 


Leu 


Gly 








20 










25 










30 






He 


Glu 


Thr 


Pro 


Leu 


Pro 


Lys 


Lys 


Glu 


Leu 


Leu 


Leu 


Pro 


Gly 


Asn 


Asn 






35 










40 










45 








Arg 


Lys 


Val 


Tyr 


Glu 


Leu 


Ser 


Asn 


Val 


Gin 


Glu 


Asp 


Ser 


Gin 


Pro 


Met 




50 










55 










60 










Cys 


Tyr 


Ser 


Asn 


Cys 


Pro 


Asp 


Gly 


Gin 


Ser 


Thr 


Ala 


Lys 


Thr 


Phe 


Leu 
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65 










70 










75 










80 


Thr 


Val 


Tyr 


Trp 


Thr 


Pro 


Glu 


Arg 


Val 


Glu 


Leu 


Ala 


Pro 


Leu 


Pro 


Ser 










85 










90 










95 




Trp 


Gin 


Pro 


Val 


Gly Lys 


Asn 


Leu 


Thr 


Leu 


Arg 


Cys 


Gin 


Val 


Glu 


Gly 








100 










105 










110 






Gly Ala 


Pro 


Arg 


Ala 


Asn 


Leu 


Thr 


Val 


Val 


Leu 


Leu 


Arg 


Gly 


Glu 


Lys 






115 










12 0 










125 








Glu 


Leu 


Lys 


Arg 


Glu 


Pro 


Ala 


Val 


Gly Glu 


Pro 


Ala 


Glu 


Val 


Thr 


Thr 




13 0 










135 










140 










Thr 


Val 


Leu 


Val 


Arg 


Arg 


Asp 


His 


His 


Gly 


Ala 


Asn 


Phe 


Ser 


Cys 


Arg 


145 










150 










155 










160 


Thr 


Glu 


Leu 


Asp 


Leu 


Arg 


Pro 


Gin Gly 


Leu 


Glu 


Leu 


Phe 


Glu 


Asn 


Thr 










165 










170 










175 




Ser 


Ala 


Pro 


Tyr 


Gin 


Leu 


Gin 


Thr 


Phe 


Val 


Leu 


Pro 


Ala 


Thr 


Pro 


Pro 








180 










185 










190 






Gin 


Leu 


Val 


Ser 


Pro 


Arg 


Val 


Leu 


Glu 


Val 


Asp 


Thr 


Gin 


Gly 


Thr 


Val 






195 










200 










205 








Val 


Cys 


Ser 


Leu 


Asp 


Gly 


Leu 


Phe 


Pro 


Val 


Ser 


Glu 


Ala 


Gin 


Val 


His 




210 










215 










220 










Leu 


Ala 


Leu 


Gly Asp 


Gin 


Arg 


Leu 


Asn 


Pro 


Thr 


Val 


Thr 


Tyr 


Gly Asn 


225 










230 










235 










240 


Asp 


Ser 


Phe 


Ser 


Ala 


Lys 


Ala 


Ser 


Val 


Ser 


Val 


Thr 


Ala 


Glu 


Asp 


Glu 










245 










250 










255 




Gly Thr 


Gin 


Arg 


Leu 


Thr 


C Y S 


Ala 


Val 


He 


Leu 


Gly Asn 


Gin 


Ser 


Gin 








260 










265 










270 






Glu 


Thr 


Leu 


Gin 


Thr 


Val 


Thr 


He 


Tyr 


Ser 


Phe 


Pro 


Ala 


Pro 


Asn 


Val 






275 










280 










285 








He 


Leu 


Thr 


Lys 


Pro 


Glu 


Val 


Ser 


Glu 


Gly 


Thr 


Glu 


Val 


Thr 


Val 


Lys 




290 










295 










300 










Cys 


Glu 


Ala 


His 


Pro 


Arg 


Ala 


Lys 


Val 


Thr 


Leu 


Asn 


Gly Val 


Pro 


Ala 


305 










310 










315 










320 


Gin 


Pro 


Leu 


Gly 


Pro 


Arg 


Ala 


Gin 


Leu 


Leu 


Leu 


Lys 


Ala 


Thr 


Pro 


Glu 










325 










330 










335 




Asp 


Asn 


Gly 


Arg 


Ser 


Phe 


Ser 


Cys 


Ser 


Ala 


Thr 


Leu 


Glu 


Val 


Ala 


Gly 








340 










345 










350 






Gin 


Leu 


He 


His 


Lys 


Asn 


Gin 


Thr 


Arg 


Glu 


Leu 


Arg 


Val 


Leu 


Tyr 


Gly 






355 










3 60 










365 








Pro 


Arg 


Leu 


Asp 


Glu 


Arg 


Asp 


Cys 


Pro 


Gly 


Asn 


Trp 


Thr 


Trp 


Pro 


Glu 




370 










375 










380 










Asn 


Ser 


Gin 


Gin 


Thr 


Pro 


Met 


Cys 


Gin 


Ala 


Trp 


Gly Asn 


Pro 


Leu 


Pro 


385 










390 










395 










400 


Glu 


Leu 


Lys 


Cys 


Leu 


Lys 


Asp 


Gly 


Thr 


Phe 


Pro 


Leu 


Pro 


He 


Gly 


Glu 










405 










410 










415 




Ser 


Val 


Thr 


Val 


Thr 


Arg 


Asp 


Leu 


Glu 


Gly 


Thr 


Tyr 


Leu 


Cys 


Arg 


Ala 








420 










425 










430 






Arg 


Ser 


Thr 


Gin 


Gly Glu 


Val 


Thr 


Arg 


Glu 


Val 


Thr 


Val 


Asn 


Val 


Leu 






435 










440 










445 








Ser 


Pro 


Arg 


Tyr 


Glu 


He 


Val 


He 


He 


Thr 


Val 


Val 


Ala 


Ala 


Ala 


Val 




450 










455 










460 










He 


Met 


Gly 


Thr 


Ala 


Gly 


Leu 


Ser 


Thr 


Tyr 


Leu 


Tyr 


Asn 


Arg 


Gin 


Arg 


465 










470 










475 










480 


Lys 


He 


Lys 


Lys 


Tyr 


Arg 


Leu 


Gin 


Gin 


Ala 


Gin 


Lys 


Gly 


Thr 


Pro 


Met 










485 










490 










495 




Lys 


Pro 


Asn 


Thr 


Gin 


Ala 


Thr 


Pro 


Pro 






















500 










505 
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95 

Met lie Glu Thr Tyr Asn Gin Thr Ser Pro Arg Ser Ala Ala Thr Gly 

15 10 15 

Leu Pro lie Ser Met Lys He Phe Met Tyr Leu Leu Thr Val Phe Leu 

20 25 30 

He Thr Gin Met He Gly Ser Ala Leu Phe Ala Val Tyr Leu His Arg 

35 40 45 

Arg Leu Asp Lys He Glu Asp Glu Arg Asn Leu His Glu Asp Phe Val 

50 55 60 

Phe Met Lys Thr He Gin Arg Cys Asn Thr Gly Glu Arg Ser Leu Ser 
65 70 75 80 

Leu Leu Asn Cys Glu Glu lie Lys Ser Gin Phe Glu Gly Phe Val Lys 

85 90 95 

Asp He Met Leu Asn Lys Glu Glu Thr Lys Lys Glu Asn Ser Phe Glu 

100 105 110 

Met Gin Lys Gly Asp Gin Asn Pro Gin He Ala Ala His Val He Ser 

115 120 125 

Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gin Trp Ala Glu Lys Gly 

130 135 140 

Tyr Tyr Thr Met Ser Asn Asn Leu Val Thr Leu Glu Asn Gly Lys Gin 
145 150 155 160 

Leu Thr Val Lys Arg Gin Gly Leu Tyr Tyr He Tyr Ala Gin Val Thr 

165 170 175 

Phe Cys Ser Asn Arg Glu Ala Ser Ser Gin Ala Pro Phe He Ala Ser 

180 185 190 

Leu Cys Leu Lys Ser Pro Gly Arg Phe Glu Arg He Leu Leu Arg Ala 

195 200 205 

Ala Asn Thr His Ser Ser Ala Lys Pro Cys Gly Gin Gin Ser lie His 

210 215 220 

Leu Gly Gly Val Phe Glu Leu Gin Pro Gly Ala Ser Val Phe Val Asn 
225 230 235 240 

Val Thr Asp Pro Ser Gin Val Ser His Gly Thr Gly Phe Thr Ser Phe 

245 250 255 

Gly Leu Leu Lys Leu 
260 



<210> 144 

<211> 187 

<212> PRT 

<213> Homo sapiens 

<400> 144 

Ala Met His Val Ala Gin Pro Ala Val Val Leu Ala Ser Ser Arg Gly 

15 10 15 

He Ala Ser Phe Val Cys Glu Tyr Ala Ser Pro Gly Lys Ala Thr Glu 

20 25 30 

Val Arg Val Thr Val Leu Arg Gin Ala Asp Ser Gin Val Thr Glu Val 

35 40 45 

Cys Ala Ala Thr Tyr Met Met Gly Asn Glu Leu Thr Phe Leu Asp Asp 

50 55 60 

Ser He Cys Thr Gly Thr Ser Ser Gly Asn Gin Val Asn Leu Thr He 
65 70 75 80 

Gin Gly Leu Arg Ala Met Asp Thr Gly Leu Tyr He Cys Lys Val Glu 

85 90 95 

Leu Met Tyr Pro Pro Pro Tyr Tyr Leu Gly He Gly Asn Gly Thr Gin 

100 105 110 

He Tyr Val He Asp Pro Glu Pro Cys Pro Asp Ser Asp Phe Leu Leu 

115 120 125 

Trp lie Leu Ala Ala Val Ser Ser Gly Leu Phe Phe Tyr Ser Phe Leu 

130 135 140 

Leu Thr Ala Val Ser Leu Ser Lys Met Leu Lys Lys Arg Ser Pro Leu 
145 150 155 160 

Thr Thr Gly Val Tyr Val Lys Met Pro Pro Thr Glu Pro Glu Cys Glu 
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165 170 175 

Lys Gin Phe Gin Pro Tyr Phe He Pro He Asn 
180 185 
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<400> 145 



He 


Pro 


Pro 


His 


Val 


Gin 


Lys 


Ser 


Val 


Asn 


Asn 


Asp 


Met 


He 


Val 


Thr 


1 








5 










10 










15 




Asp 


Asn 


Asn 


Gly Ala 


Val 


Lys 


Phe 


Pro 


Gin 


Leu 


Cys 


Lys 


Phe 


Cys 


Asp 








20 










25 










30 






Val 


Arg 


Phe 


Ser 


Thr 


Cys 


Asp 


Asn 


Gin 


Lys 


Ser 


Cys 


Met 


Ser 


Asn 


Cys 






35 










40 










45 








Ser 


He 


Thr 


Ser 


He 


Cys 


Glu 


Lys 


Pro 


Gin 


Glu 


Val 


Cys 


Val 


Ala 


Val 




50 










55 










60 










Trp 


Arg 


Lys 


Asn 


Asp 


Glu 


Asn 


He 


Thr 


Leu 


Glu 


Thr 


Val 


Cys 


His 


Asp 


65 










70 










75 










80 


Pro 


Lys 


Leu 


Pro 


Tyr 


His 


Asp 


Phe 


He 


Leu 


Glu 


Asp 


Ala 


Ala 


Ser 


Pro 










85 










90 










95 




Lys 


Cys 


He 


Met 


Lys 


Glu 


Lys. Lys 


Lys 


Pro 


Gly Glu 


Thr 


Phe 


Phe 


Met 








100 










105 










110 






Cys 


Ser 


Cys 


Ser 


Ser 


Asp 


Glu 


Cys 


Asn 


Asp 


Asn 


He 


He 


Phe 


Ser 


Glu 






115 










120 










125 








Glu 


Tyr 


Asn 


Thr 


Ser 


Asn 


Pro 


Asp 


Leu 


Leu 


Leu 


Val 


He 


Phe 


Gin 


Val 




13 0 










135 










140 










Thr 


Gly 


He 


Ser 


Leu 


Leu 


Pro 


Pro 


Leu 


Gly 


Val 


Ala 


He 


Ser 


Val 


He 


145 










150 










155 










160 


He 


He 


Phe 


Tyr 


Cys 


Tyr 


Arg 


Val 


Asn 


Arg 


Gin 


Gin 


Lys 


Leu 


Ser 


Ser 










165 










170 










175 




Thr 


Trp 


Glu 


Thr 


Gly 


Lys 


Thr 


Arg 


Lys 


Leu 


Met 


Glu 


Phe 


Ser 


Glu 


His 








180 










185 










190 






Cys 


Ala 


He 


He 


Leu 


Glu 


Asp 


Asp 


Arg 


Ser 


Asp 


He 


Ser 


Ser 


Thr 


Cys 






195 










200 










205 








Ala 


Asn 


Asn 


He 


Asn 


His 


Asn 


Thr 


Glu 


Leu 


Leu 


Pro 


He 


Glu 


Leu 


Asp 




210 










215 










220 








Thr 


Leu 


Val 


Gly Lys 


Gly Arg 


Phe 


Ala 


Glu 


Val 


Tyr 


Lys 


Ala 


Lys 


Leu 


225 










230 










235 










240 


Lys 


Gin 


Asn 


Thr 


Ser 


Glu 


Gin 


Phe 


Glu 


Thr 


Val 


Ala 


Val 


Lys 


He 


Phe 










245 










250 










255 




Pro 


Tyr 


Glu 


Glu 


Tyr 


Ala 


Ser 


Trp 


Lys 


Thr 


Glu 


Lys 


Asp 


He 


Phe 


Ser 








260 










265 










270 






Asp 


He 


Asn 


Leu 


Lys 


His 


Glu 


Asn 


He 


Leu 


Gin 


Phe 


Leu 


Thr 


Ala 


Glu 






275 










280 










285 








Glu 


Arg 


Lys 


Thr 


Glu 


Leu 


Gly Lys 


Gin 


Tyr 


Trp 


Leu 


He 


Thr 


Ala 


Phe 




290 










295 










300 










His 


Ala 


Lys 


Gly Asn 


Leu 


Gin 


Glu 


Tyr 


Leu 


Thr 


Arg 


His 


Val 


He 


Ser 


305 










310 










315 










320 


Trp 


Glu 


Asp 


Leu 


Arg 


Lys 


Leu 




Ser 


Ser 


Leu 


Ala 


Arg 


Gly 


He 


Ala 










325 










330 










335 




His 


Leu 


His 


Ser 


Asp 


His 


Thr 


Pro 


Cys 


Gly 


Arg 


Pro 


Lys 


Met 


Pro 


He 








340 










345 










350 






Val 


His 


Arg 


Asp 


Leu 


Lys 


Ser 


Ser 


Asn 


He 


Leu 


Val 


Lys 


Asn 


Asp 


Leu 






355 










360 










365 








Thr 


Cys 


Cys 


Leu 


Cys 


Asp 


Phe 


Gly 


Leu 


Ser 


Leu 


Arg 


Leu 


Asp 


Pro 


Thr 




370 










375 










380 










Leu 


Ser 


Val 


Asp 


Asp 


Leu 


Ala 


Asn 


Ser Gly 


Gin 


Val 


Gly 


Thr 


Ala 


Arg 


385 










390 










395 










400 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Glu 


Ser 


Arg 


Met 


Asn 


Leu 


Glu 


Asn 


Ala 










405 










410 










415 
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Glu 


Ser 


Phe 


Lys 


Gin 


Thr 


Asp 


Val 


Tyr 


Ser Met 


Ala 


Leu 


Val 


Leu 


Trp 








420 










425 








430 




Glu 


Met 


Thr 


Ser 


Arg 


Cys 


Asn 


Ala 


Val 


Gly Glu Val 


Lys 


Asp 


Tyr 


Glu 






435 










440 








445 








Pro 


Pro 


Phe 


Gly 


Ser 


Lys 


Val 


Arg 


Glu 


His Pro 


Cys 


Val 


Glu 


Ser 


Met 




4 5 0 










455 








460 










Lys 


Asp 


Asn 


Val 


Leu 


Arg 


Asp 


Arg 


Gly Arg Pro 


Glu 


He 


Pro 


Ser 


Phe 


465 










470 








475 










480 


Trp 


Leu 


Asn 


His 


Gin 


Gly 


lie 


Gin 


Met 


Val Cys 


Glu 


Thr 


Leu 


Thr 


Glu 










485 










490 








495 




Cys 


Trp 


Asp 


His 


Asp 


Pro 


Glu 


Ala 


Arg 


Leu Thr 


Ala 


Gin 


Cys 


Val 


Ala 








500 










505 








510 






Glu 


Arg 


Phe 


Ser 


Glu 


Leu 


Glu 


His 


Leu 


Asp Arg 


Leu 


Ser 


Gly 


Arg 


Ser 






515 










520 








525 








Cys 


Ser 


Glu 


Glu 


Lys 


lie 


Pro 


Glu 


Asp 


Gly Ser 


Leu 


Asn 


Thr 


Thr 


Lys 




530 










535 








540 









<210> 


146 




<211> 


3 5 8 




<212> 


PRT 




<213> 


Homo 


sap 


<400> 


146 




Cys 


Glu 


Glu 


Pro 


1 








Lys 


Pro 


Tyr 


Tyr 








20 


Gly 


Tyr 


Phe 


Tyr 






35 




Asn 


His 


Thr 


Trp 




50 






Cys 


Pro 


Tyr 


He 


65 








Gly 


Thr 


Tyr 


Glu 


Tyr 


Tyr 


Leu 


He 








100 


Val 


Ala 


He 


Trp 






115 




Thr 


Pro 


Pro 


Pro 




130 






Glu 


Val 


Phe 


Glu 


145 








Pro 


Gly 


Pro 


Asp 


Gly Asp 


Asn 


Ser 








180 


Lys 


Cys 


Arg 


Phe 






195 




Gly 


Lys 


Lys 


Phe 




210 






Gly 


Phe 


Tyr 


Leu 


225 








Thr 


Trp 


Asp 


Pro 


Ser 


Thr 


Lys 


Pro 








260 


Thr 


Lys 


Ser 


Pro 






275 




Pro 


Pro 


Val 


Ser 




290 






Leu 


Asp 


Ser 


Leu 



iens 



Pro Thr Phe Glu 
5 

Glu He Gly Glu 

He Pro Pro Leu 
40 

Leu Pro Val Ser 
55 

Arg Asp Pro Leu 
70 

Phe Gly Tyr Gin 
85 

Gly Glu Glu He 

Ser Gly Lys Pro 
120 

Lys He Lys Asn 
135 

Tyr Leu Asp Ala 
150 

Pro Phe Ser Leu 
165 

Val Trp Ser Arg 

Pro Val Val Glu 
200 

Tyr Tyr Lys Ala 
215 

Asp Gly Ser Asp 
230 

Pro Val Pro Lys 
245 

Pro Ala Leu Ser 

Ala Ser Ser Ala 
280 
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